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AGGRESSIVE AND WITHDRAWING CHILDREN IN THE 
NORMAL SCHOOL 


I—PATTERNS OF BEHAVIOUR 


By E. A. LUNZER 
(Department of Education, University of Manchester.) 


SumMMARY. Forty-two “a ive” 
aggressive” and forty “wi i 
7 y “withd: ” chi 
by teachers out of 2 tot school population of 1,002 children, are Compared 
c ith a group of “exceptionally n, compared 
Comparisons are based on rating scales, eiat Cul Moe "ieri 
a HS or e and a test of social adaptation Pë 5 NEL cum 
ie results indicate that there a i i 
aggression a withdrawal but that Peel aad Es d Ero problems of 
js comparable. These indications are confirmed iy a limited Re ae e 
-year follow-up. 


I.—Intropucrion. 


Tue work of Wickman (1928) is one of the b 
eee tes hygiene ia m musicae A sip of a et a sb the field of 
st of symp p of maladjustment in what they considered t t ed to rank a 
severity. A similar list was ranked by a group of psychol o wj their order of 
and psychiatric social workers. Teachers gave a great d Sps $, psychiatrists 
violations of authority and other ' aggressive ' forms of bel more empha: is. to 
kinds of behaviour which might be held to oec glen yo while’ those 
ersonality were relegated to a very low position in thei se the shy, recessive 
that these symptoms were not regarded as serious in ‘ieee suggesting 
of Wickman's procedure have been rightly criticised p import. The details 
which need not concern us here. It remains that hig b que of grounds 
any times and the results have been substantiall virago has been 
derable care was taken to avoid the procedural med i6 emi 
The whole of T work has been admirably erena peius 
X ted study by Hollins (1955). This investigati rd a 
comparison between the rankings of Manchester teachers and (i on involved a 
of P.S.Ws. In general, the findings of Wickman were again confi ose of a group 

There is an unstated value judgment implicit in such studi a i 

assumed that the opinions of the clinicians are a correct refl es: it is tacitly 
and that if teachers’ opinions differ from theirs, it is b ection of the facts, 
insufficiently aware of the essential principles of ‘child d cause teachers are 
something more than an argumentum ex auctoritate? Th levelopment. Is this 
hood maladjustment for adult mental health are still và S aplication of child- 
Long-term studies at present being carried out by icine terra incognita. 
no doubt, go far to remedy this deficiency. At present, th ondon Institute will, 
persistent delinquent behaviour in childhood and a dul mpeg pan between 
established than the connection between recessive behavi criminality is better 
dysthymic neuroses in later life. This is not to say that end in childhood and 
about the relation. between aggressive or delinquent are more is known 
criminality than about the implications of recessive sym avioni and adult 
therefore the latter are necessarily less grave in their ie e in children, 
are indeed more serious. Nevertheless, there P e E my P. 
ed for 


that they 
ore factual knowledge. The present investigation was undertaken as t 
in Y a step 
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II.—SAMPLING PROCEDURE. 


Some twenty advanced students, teachers attending the course in Child 
Psychology for the Diploma in Education at Manchester University, were asked 
to put forward one or two names of children in their class or classes who were 
most ‘aggressive’, and one or two, depending on the size of the class, who were 
most ‘withdrawing ', together with an equivalent number of ‘ exceptionally 
well-adjusted ' children. The aim was to obtain two experimental groups who 
might reasonably be described as deviants at the 5 per cent. level—which would 

- correspond in all to 10 per cent. of the population from which they were taken. 
That figure is not disproportionate to the percentages reported in the three 
pilot surveys carried out for the Committee on Maladjusted Children as requiring 
some psychiatric attention. Each of these divided the children in a representa- 
tive cross-section into five categories. Category A were those rated as seriously 
maladjusted, requiring clinical treatment; category B were those who were 
probably maladjusted and should be recommended for psychiatric interview, 
but not necessarily for treatment. The remaining three categories represented 
freedom from psychiatric symptoms, or relatively minor troubles which 
could be countered without recourse to the child guidance service. Overall 


percentages in the first two categories combined were 11-3 in Somerset, 15-8 in 
Birmingham, and 5-4 in Berkshire. 


Teachers were encouraged to nominate more than the minimum for the two 
‘ experimental ' categories if they were especially unfortunate in the class that 
they had, but in point of fact, none of them did so, although there are reasonable 
indications from a consideration of those children who were included in the 
sample that some of them might have done so. On the other hand, several 
teachers put forward fewer names than their statistical allocation, on the 
ground that they had not enough children in their classes who might be regarded 
as even remotely ' unsettled’, and they may well have been justified. 


Since not all the children who were nominated were eventually rated and 
tested in the course of the subsequent investigation, the final samples were a 
little smaller than the original, and differed, albeit slightly, from one comparison 
to another. For most purposes the three samples consisted of forty-five 
oe ' children, forty-two ‘ aggressive ' and forty ‘withdrawing’. Only 
recA p in each group were in secondary modern schools, and of the 
e$t r, about ten in each group were infants and the others were of junior 

E e total population from which these selections were made consisted of 


twenty-eight classes drawn from thi A 
5 , class irteen s s a ar s 
i.e., 1,002 children in all. een schools in and around Manchester, 


TABLE 1 


ind DX OHIBDESR REPRESENTED IN THE THREE SAMPLES. 
(W= Withdrawing. WA=Well-adjusted. A= Aggressive.) 


3 3 9 


x Age-Group wW WA A Total 
5 — 7:55 5 
E D 5 $ - 
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Analysis of variance showed the three samples to be strictly comparable 
in respect of age. 


The teachers who made the selection were free to nominate boys or girls, 
whichever were best fitted to the category for which the selections were made. 
With a single exception, all of the twenty-eight classes concerned were mixed. 
In the event the well-adjusted sample consisted of twenty-one boys and twenty- 
four girls, the withdrawing sample of eighteen boys and twenty-two girls and 
the aggressive sample of thirty-three boys and only nine girls. Sex differences 
in the first two groups are not significant, but in the aggressive group, with 
x°=6-3, the difference is significant at the -02 level. This fits in with the 
observation that more boys than girls are referred to child guidance clinics. 
Since the incidence of recessive behaviour is approximately equal in the two 
sexes, while aggressive behaviour is considerably more prevalent in boys, and 
since referrals by parents, teachers and juvenile courts are more likely to be on 
the grounds of ‘difficult ’ behaviour than excessive shyness, as we know from 
Wickman, Stogdill (1931) and their followers, it is not surprising that referrals 
as a whole, that is including those made by L.E.As. and G.Ps., should number 
more boys than girls. It also suggests that aggressive behaviour is more 


frequent as a whole than recessive—although it tells us nothing as to which is 
more serious. 


Comparisons between the three samples with respect to size of family, 
position in family and whether the mothers were at home or at work all showed 
insignificant differences, although the trends (more youngest and only children 
in the withdrawing sample, more mothers working full-time in the aggressive 
sample) were in line with expectation, 


II.— RATINGS. 


All of the children in each of the three samples were assessed on each ot 
twelve traits. These were selected from a number of available lists* to cover 
a fairly wide range of behaviour with a minimum of overlap. Careful precautions 
were taken to minimise halo effect. All the children in each class concemed 
were ranked on separate occasions for each trait, and the ratings themselves 
were arrived at subsequently by allocating all of these to one of the five categories 
in approximately a normal distribution. It will be seen that children included 
in these samples differ very widely from the normal distribution which character- 
ised the population from which they were drawn. The number of children 
assessed at each of the 5 points on each of the twelve traits is shown in Table 2. 


* Schonell (1944), N.F.E.R. (1952) and Fleming (1954). 
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TABLE 2 
ASSESSMENTS ON RATING SCALES. 
“VRAII-HATINGS ON A FIVE-POINT SCALE AS MADE BY CLASS TEACHERS. 


Aggression : "C 
ive—Compliant/Timid.... 17 
Aggressive p. / ^ £ 7 
15 5 a 
Highly significant 


Concentration : 
Absorbed—Wandering 4 18 12 
18 15 1 
9 10 10 
Not significant. 


Self-Confidence : 
Very confident/cocksure—Over — 5 23 

dependent 7 3 

9 16 5 

Highly significant. 


Persistence in Failure : 
Purposeful—Depressed/Resentful 3 16 12 
23 9 2 
9 8 15 
Significant at -01 level. 


Maintenance of Effort : 
Sustained enthusiasm—Listless/ 8 14 10 
Apathetic 20 11 2 
10 12 17 

Not significant. 


Sensitiveness to Praise and Blame : 
Extremely sensitive—Quite 12 14 4 
indifferent. 6 35 2 
3 15 19 
Highly significant. 


Neat-Handedness : 


Skilful—Clumsy 4 19 10 


18 13 3 
5 19 12 
Not significant. 


Attitude to Adult Authority : 
Highly co-operative—Frequently 4 30 5 
obstructive/Rebellious 14 td — 
9 13 11 
Highly significant. 


Sociability : 


Only happy with group—Prefers 2 10 22 
to be alone 13 27 2 


20 11 8 
Highly significant. 


Initiative : 
Marked originality and drive— — 7 19 
Very dependent/diffident .. 23 13 — 


16 13 8 
Highly significant. 


Response in Class : 
Exceptionally good talker— — 5 16 
Hesitant/Difficult to draw 13 20 2 
out 16 17 4 
Highly significant. 


Conscientiousness : 

High standard of work/Scrupulous 9 19 10 
—tlrresponsible/No ambition 21 15 1 
in work 6 17 14 


Not significant. 
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Differences with respect to ‘aggression’ merely indicate that not all 
children shared symptoms of aggression or withdrawal to the same degree. The 
ratings given in respect of concentration, persistence in failure, and mainten- 
ance of effort are of particular interest, since these may be held to indicate the 
effect of maladjustment without being specific as to its direction. It may be 
said that maladjustment in general is a response to stress characterised by 
anxiety. The nature of the response will vary from one individual to the next 
and, in particular, it will be greatly different in the recessive children from the 
aggressive. But whatever the specific response pattern, there will be inhibition 
of the highest centres of nervous activity, resulting in a loss of concentration, 
in lack of persistence and in lack of enthusiasm for work tasks. 

It appears that more children in the aggressive group show an extreme 
inability to concentrate, but the difference between this group and the with- 
drawing is not significant (y2—6-9, d.f.=4). Differences with respect to 
persistence in failure are significant. Pooling categories 1+2 and 44-5 in a 
3x2 table, x2=10-0 which is significant at the -01 level (figures for the well- 
adjusted group are not included in any calculations, since these are clearly 
significantly different, but reveal no more than the general importance of good 
adjustment in fostering favourable behaviour traits). Again, the distribution of 
the two groups when assessed for maintenance of effort favours the aggressive. 
The disproportion between categories 4 (Only fitful application. Fairly frequent 
rests.) and category 5 (Seems too listless and apathetic to do much) is striking. 
It is in line with what is known of asthenic neuroses but it accords ill with the 
characterisation of introverts as readily conditioned or excitatory. Much 
may depend on the stability of the nervous system and on the level of complexity 
at which the conditioning is demanded. 

Taken as a whole, assessments on the three traits do not warrant more than 
limited conclusions. There appears to be little difference in the incidence of 
more or less anxiety in the two groups insofar as concentration on day-to-day 
tasks is impaired, but the withdrawing group show less determination in the 
face of failure and a greater tendency to fatigue. However, any inferences as to 
the quality of work produced by the two groups will be seen to be not warranted 
by the facts, as is shown in a forthcoming paper (Lunzer, 1960). 

The distribution of ratings on the remaining traits point to the marked 
differences between the two groups of maladjusted children. In several 
instances the aggressive and withdrawing groups are at opposite extremes, with 
the well-adjusted control group midway between them. The withdrawing 
group are significantly more dependent and more sensitive to praise and blame.* 
They show less initiative and less response in class. It would appear that they 
are also less sociable, although this is not confirmed by the index of popularity 
as shown in sociometric choices made by their classmates. Differences with 
respect to conscientiousness and neat-handedness are not significant. 


IV.—Bnisror Guiprs. 


A further indication of the degree of maladjustment present in each of the 
two experimental groups is provided by an inspection of the Bristol Social 
Adjustment Guides. These were completed by the class teacher concerned in 
respect of every child included in the sample. Following on the procedure 
suggested by Stott (1958) each indicator of maladjustment was given a weighting 
of two points while the milder symptoms of ' unsettledness ’ as also the indicators 


* t may be of some significance that the ten withdrawing children rated as extremely 
sensitive, were all among the youngest in the sample. Do these children develop a thicker 


skin as they grow older ? 
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of nervousness and various miscellaneous nervous symptoms were scored as one 
point. The total number of points awarded to each child thus gives some 
indication of the degree of disturbance that is present, as measured by the 
tendency for that child to show more or less considerable deviations from the 


normal in various aspects of his behaviour. The results are shown in the follow- 
ing table: 


TABLE 3 
MALADJUSTMENT SCORE ON BRISTOL GuiDEs. 


It will be seen that 
control group, 
withdrawing. 
ance than the latter, but the diffe: 


nalysis of variance indicated that the degree of 
he predominance of one or other pattern of reaction 


point of view, the Bristol Guides yield a far more 
of personality disturbance than can be had from this 
ethod of scoring and it is of some interest to note the extent 
H t deviations fall into distinct patterns for the two groups. 
ere the hypothesis would be that both groups might be expected to show 
essness, nervousness, depression and miscellaneous nervous 

symptoms, but that the withdrawing group would be characterised by signs of 
unforthcomingness and withdrawal, while the aggressive group would tend to 
show combinations of anxiety for affection or approval on the one hand, and, 
on the other, hostility both in relation to adults and in relation to their own 
trm F The Hictietory was fairly clear-cut in thirty-one out of the forty- 
ihe ee e Ta and twenty-seven out of the forty withdrawing, while 
aggressive and five withà bum c; over the other was marked in a further five 
Seven out of the forty withdraw; 


* A similar clear-cut dichotomy emerges in a recent study by Peterson, et al. (1959). 
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In view of the comparable scoring of the two groups when assessed by means 
of these Guides, they afford some indication of the degree of disturbance present 
in each case. If an arbitrary figure of 16} is taken as indicative of more or less 
serious disturbance, then twenty out of the forty withdrawing cases may be 
said to be ‘maladjusted’ and twenty-four out of the forty-two aggressive. 
Altogether, this would be equivalent to forty-four out of the total population of 
1,002 or 4-4 per cent. It would appear that the incidence of disturbance as a 
whole is approximately equally divided in this sample as between the two 
behaviour patterns. However, in view of the sex differences already noted, it is 
probable that more cases might have been brought to our notice in the aggres- 
sive sample, although, for the most part, these would have been milder. In 
addition, there is an unknown but probably limited number of children whose 
names were deliberately withheld because the disturbance was too ambiguous. 


V.—SocIoMETRIC CHOICE-RANKING. 


An important index of the degree to which the adjustment of a child is 
impaired by anxiety-induced behavioural patterns may be had from an examina- 
tion of his capacity to gain the friendship and approval of his peers. According- 
ly, a sociometric test was carried out by the teachers of the majority of the 
classes concerned (over the age of 7). Three criteria were used, with the first 
three choices weighted 5, 3 and 2. In addition, to allow for an indication of 
' social expansiveness ’ each child was allowed an unlimited number of choices 
' below the line ' and these were given unit weightings. The total score obtained 
by each child was found from the class sociomatrix and all the children in the 
class were then ranked. Scores of children in the three samples are percentile 
rankings and are, therefore, comparable as between classes of different sizes. 
These are shown in Table IV. 


TABLE 4 


SOCIOMETRIC PERCENTILE RANKINGS. 


The differences between the two experimental groups and the control is 
significant beyond the -01 level. The fact that the latter is so high merely 
indicates the fact that teachers, in choosing children as ' exceptionally well- 
adjusted ', tended on the whole to select children who were popular. What is 
more revealing is that, here again, there is no difference whatever as between 
the means for the two experimental groups and the S.Ds. are comparable. The 
fact that emotional adjustment is a factor in sociometric ranking has been 
shown by Dahlke (1953). But it is of some interest to note that the relation is 
so marked that, even within the two experimental groups used in this study, 
representing approximately only the lower 10 per cent. of their population, we 
found correlations of —:443 (aggressive), and —-475 (withdrawing) between 


f It is probable that the arbitrary figure of 16 represents a somewhat greater degree of 
disturbance than Category B in the three surveys carried out on behalf of the Underwood 
Committee, though less than category A. 
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sociometric ranking and maladjustment score on the Bristol Guides. Both of 
these are significant at the -02 level. However, although it might be inferred 
that the two patterns are equally severe insofar as they affect children’s relations 
with one another, it must be admitted that a more detailed study of this aspect 
of behaviour should be more revealing. Are any of these children actively 
rejected? How far are their own choices reciprocated ? To what extent do they 
fit in with the ' in-groups ' of their respective cla 
which must be left to further investigation.* If these findings are compared 
with the ratings given to these same children 


on the trait of sociability (see 
Table 2), we see that now there is a marked difference between the two groups, 


with the aggressive more dependent on company than even the well-adjusted. 
In some cases this might lead them to feel frustrated by the lack of approval 
given them by the class as a whole. More often, one may suppose, they tend 
to form into delinquent or sub-delinquent groups. On the other hand, the 
tendency to be somewhat unsociable or to prefer being alone which appears in 


the withdrawihg group may be presumed to represent their own response to 
anxiety—by avoidance. 


Sses? These are questions 


comparisons are internal t 
have no meaning, but the rel 


TABLE 5 


SOCIAL ADAPTATION. 


Previous research has shown that delinquents tend to have low scores on the 
Doll scale (Dunsden, 1947). a 


} l It was anticipated that this was largely due to the 
high weighting of cognitive i k 


items in the scale and that the effect would dis- 
appear when these items were se 
out, and the relative standi 


items are deducted from the totals. On the other 
that the withdrawing group would be particularly 
an 3s W. = z3 Blyth (1959) has shown that is is 
er cent. of cases, using only ecological It si ll: bable that prediction 

of such choi i d : eems wholly probal p 
ao with ecoleree” taised to a very high level indeed by combining temperamental 


itis D alae oe Was a try-out for the Vineland-Manchester Scale of Social Adaptation 
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low in social independence is fully corroborated. Results on this scale were 
compared when the three groups were sub-divided into three age sub-groups, 
and tested by analysis of variance. The trends were consistent for all ages and 
the differences proved significant well beyond the -01 level. The conclusion 
might well be drawn that if some of these children were encouraged to gain 


greater independence, their adjustment might improve. 


VII.—ONE-YEAR FOLLOW-UP. 


The evidence so far adduced suggests that there are probably more cases 
of maladjustment which take a predominantly aggressive form, but that the 
severity of the handicap may be comparable for the two groups as a. whole, 
although on some counts at any rate the valuation of Wickman’s clinicians is 
borne out. However, for the most part the results are less decisive, and the 
problems as a whole seemed to us sufficiently important to warrant a more 
decisive form of analysis. This can only be done by means of a long-term follow- 
up study. In the final analysis those forms of behaviour which are overcome in 
the course of maturation and experience may be regarded as benign, and those 
which are not merit our more serious attention. Asa preliminary to such an 
investigation, the teachers concerned were asked to complete a second Bristol 
Guide in respect of the majority of the children rated as severely disturbed 
(i.e., those who had been awarded a score of 16 or more in our system of scoring 
the Guides) and at the same time to grade them on a five-point scale as con- 
siderably improved, improved, much the same, somewhat deteriorated, or 
considerably deteriorated, adding any remarks which they thought relevant. 
On the basis of their returns it was possible to grade eleven of the withdrawing 
cases and thirteen of the aggressive on a seven-point scale, as shown in Table 6. 


TABLE 6 


FoLLOW-UP OF RESULTS FOR TWENTY-FOUR CHILDREN. 


Marked _ Slight Slight Marked 
improve- | Improve- | improve- No deteriora-| Deteriora-|deteriora- 
ment ment ment change tion tion tion 


It is of some interest to note that if the pattems of behaviour shown in 
these second returns are compared with the patterns shown one year previously, 
we find some admixture of the anxiety-hostility pattern in two out of the 
eleven withdrawing cases, although the predominance of items remains 
unchanged. Similarly, two out of the thirteen aggressive cases now show some 
signs of withdrawal or ‘ unforthcomingness. In all but one of the remaining 
cases the pattern remains the same even though the number and severity of 
symptoms may be reduced. In only one case is there a virtual reversal of 

atten. These trends tend to confirm the theoretical presupposition underlying 
the initial selection. First indications would be that spontaneous improvement 
occurs in something over half of all cases, in both groups, but that the change 
may be more spectacular in the aggressive, especially those who, thoug 
markedly disturbed, are not seriously maladjusted. However, these conclusions 
are very tentative and it is hoped to focus our attention on this aspect in future 


investigations. 
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SYMPOSIUM: THE DEVELOPMENT OF MORAL VALUES 
IN CHILDREN 


VII—THE CONTRIBUTION OF LEARNING THEORY. 


By H. J. EYSENCK 
(Institute of Psychiatry, University of London.) 


Summary. It is argued that moral values are learnt in the course of the child’s 
development, and that consequently any theory regarding their development 
should be based on the known facts and principles of modern learning theory. 
An attempt has been made to do so by suggesting that ‘conscience’ is, in fact, 
a conditioned response built up during the child’s formative years by the pairing 
of conditioned stimuli (arising from aggressive, predatory, and overtly sexual 
actions) and unconditioned stimuli (slaps, beatings, ‘ shaming’, and other 
punishments) immediately following the conditioned stimulus. Aided by a 
process of stimulus generalization this should, in course of time, lead to an 
association between the conditioned stimulus and the fear-anxiety responses 
appropriate to the unconditioned stimulus. Certain deductions are made from 
this hypothesis and are shown to be supported by experimental evidence. It is 
argued in particular that if the hypothesis is correct, then individual differences 
in conditionability, i.e., the ease with which conditioned responses are formed, 
should be related to moral behaviour and the ease with which guilt feelings 
are aroused. Here also evidence is quoted to suggest that this hypothesis has 
some foundation in fact. It is concluded that the treatment which children, 
adolescents, and adults receive in response to immoral acts, should be based 
upon the recognition of individual differences between them, particularly with 
respect to their conditionability, as only in this way will the most beneficial 
results of such treatment be realized. 


L.—PRELIMINARY CONSIDERATIONS. 


THERE is little doubt that attitudes, interests and values are acquired through 
some process of learning or conditioning during the course of the individual's 
development, and it seems almost certain from the great deal of evidence 
accumulated by psychologists, and reviewed by earlier contributors to this 
symposium, that much of this process of learning takes place during childhood 
and possibly adolescence. If it is agreed that responses indicative of moral 
values, whether verbal or behavioural, are indeed, learned responses, then it 
would seem that modern learning theory, which is probably the most advanced 
part of psychology, should have a contribution to make to our understanding 
of their development. In what follows I have tried briefly to indicate the 
direction which such a contribution might take, and also to quote some experi- 
mental data which seem to support the position taken here. 

There is an obvious dualism involved in talking about ' moral values’. 
We may be concerned with the knowledge of existing values in a society, and 
this can be verbally expressed in a questionnaire or an interview ; this know- 
ledge is not necessarily related to conduct although there does appear to be a 
slight tendency for such correlations to exist. Nevertheless, the delinquent 
child as well as the criminal adult is usually only too well aware of the fact that 
his conduct is contrary to moral precept ; his evil-doing is not by and large due 


to ignorance. 
8 


II 


I2 Learning Theory and Moral Values in Children 


i of defining the moral values of a person deals with 
i E pompes his iionDene and it would seem that this 
pedes hi i robably more fundamental and more fruitful than the other. Here 
ac IN oats we have a choice to make. We can deal with the matter at the 
pe het le and naturalistic level, i.e., by studying actual am en m 
has been done for instance by Ackerson (1942), or by Hewitt and Jenkins (us : ). 
Alternatively, we can make use of the experimental method, and dp e ru 
quent behaviour in the laboratory, in the classroom, or in strictly cop rolle 
conditions, as was done for instance in the famous Character ET e: nepisg, 
In either case we discover that among children, adolescents and adu a here 
appears to be a range of delinquent behaviour from the poreon W o zoer 
commits a delinquent or dishonest act, to the person who almos vana ly 
does so. The task of the learning theorist is to account for this gradation in 
behaviour along the well established lines of learning theory. 


i i i i t the funda- 

Many people interested in the social consequences of crime pu 
mental question in the following way. They ask : How is it that some people 
commit crimes when they know perfectly well that their acts are anti-soci , 


predatory, is surely the natural Way to act for animals 
the question is: ^ 


with the dictates of Society, however inconvenient and distressing these dictates 
may be, and however much they may conflict 
needs and drives? ” 


It is often Suggested that people refrain from immoral acts because of the 
fear of punishment. It is unlikely that this h 


an increase in drive usually leads t. 


IL—Tur pasts OF ‘ CONSCIENCE,’ 


These and many other reasons make it unlikely that the threat of punish- 
ment is the only or even the main reason for moral behaviour. The alternative 
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suggested by many writers has been a kind of interiorised policeman variously 
named conscience, ‘inner light,’ or super-ego.* This is usually conceived as 
some kind of dews ex machina implanted in the human being in some mysterious 
way, which ceaselessly keeps an eye on his activites, and gives him a sharp 
tweak, whenever he deviates from the straight and narrow path of duty. 
Descriptively, this is probably not entirely an inaccurate account; the difficulty 
with it is that there is no known mechanism by means of which such an inner 
policeman could be called into being, and that the description does not give us 
any clues about the reasons why some people have a strong and tender conscience 
while others seem to be completely lacking in it. It is sometimes suggested that 
environment and teaching are responsible for the apparent individual differences, 
or that there might be an intellectual deficit which makes some people fail to 
respond to moral teaching. There is, of course, no doubt that environmental 
pressures play an important part in the growth of moral ideas and conduct, but 
such a hypothesis would not account for the frequently observed fact that even 
the best environment often produces psychopathic individuals apparently 
lacking completely in any ‘ inner guiding light,’ while the very poorest environ- 
ment does not by any means invariably produce criminals. Similarly, there is 
not very much relationship between lack of intelligence and criminality ; what 
pua shes is is more likely between low intelligence and likelihood of being 
ound out ! 


The suggestion made here is a relatively simple one, namely, that conscience 
is a conditioned anxiety response to certain types of situations and actions. In 
the typical Pavlovian experiment, the dog, through simple pairing of conditioned 
stimulus (bell) and unconditioned stimulus (meat powder) learns to salivate to 
the bell, whereas previously it only salivated to the meat powder. Everyone is 
familiar with this experimental paradigm, and with the fact that similar 
conditioned responses, particularly of the autonomic nervous system, can be 
quite easily produced in human beings as well. There is ample evidence to 
show that anxiety is a conditioned fear response attached to a previously neutral 
stimulus. Watson’s famous experiment with little Albert is probably too well 
known to need extensive retelling ; he induced a phobia for furry animals in an 
11-month old infant ‘ Albert,’ who previously had been fond of such animals, 
by banging a metal bar with a hammer behind Albert’s head whenever the 
infant reached out to pat a white rat (Watson and Raynor, 1920). Learning 
theory has formalised the rules according to which this conditioning takes place, 
but we need not be concerned with anything but the bare fact that anxiety and 
fear responses can be conditioned in human beings with very great ease. It is also 
to be noted that such conditioned responses, once they are formally established, 
do not extinguish by themselves in the course of time, but require an experi- 
mental process of extinction (Osgood, 1953.). 


* The Freudian conception is concisely given in Freud’s “ General Introduction ” : 
“ It is not to be doubted for a moment that one may recognize in the Oedipus-complex 
one of the most important sources for the consciousness of guilt with which neurotics are 
so often harrassed . . . Perhaps mankind as a whole has, at the beginning of its history, come 
by its consciousness of guilt, the final source of religion and morality, through the Oedipus 
complex." Perhaps. Even if the first part of this quotation had some factual reference, 
it would still not be clear how a phenomenon assumed to be of universal occurrence, like the 
Odeipus complex, could be used to account for individual differences in proneness to guilt 
feelings. Nor is it at all clear how such a theory could be experimentally or even observa- 
tionally tested. Possibly one might deduce that boys brought up by widowers whose wives 
died in childbirth should all develop into psychopaths. At least they should stand in little 


danger of developing neurotic guilt feelings ! 


I4 Learning T. heory and Moral Values in Children 


The application of this well-documented process to the development of 
moral behaviour can be briefly indicated in the following way. A young child 
behaves in a socially undesirable manner, i.e., by being aggressive, by indulging 
in overt sexual activity, by stealing, lying and cheating, or in whatever way 
anti-social behaviour is defined in a given society. There is an immediate sharp 
punishment—a slap, withdrawal of some privilege, ‘shaming’, exclusion from the 
family circle or whatever it may be. This punishment produces pain and fear 
and the associated autonomic disturbances ; these in turn become attached to 
the type of situation and the type of action which called forth the punishment, 
thus producing a conditioned anxiety reaction whenever similar situations and 
actions re-occur. . f 
It might be argued that surely identical situations never occur, but this 
ction is taken care of by the well-known fact of stimulus generalisation, 
Little Albert was taught to be afraid of a rat, but this conditioned fear general- 
ised to rabbits and other furry animals. Stimulus generalisation, therefore, 
will account for the fact that not only identical but similar situations and 


-G.R. response to the word ‘ cow’, 


lectric shock, will now have a 
conditioned P.G.R. response also to the names of other animals, such as * goat ’; 


r, generalise responses to words such as ‘ how’ which are, 
apparently, much more similar to the original conditioned stimulus from the 
point of view of sound. 
We thus have a growing child in whom Conditioned anxiety responses have 
been built up to anti-social behaviour in situations involving aggression, sex, 
etc. When temptation arises, there also arises the conditioned anxiety, and we, 
consequently, have some form of hedonic calculus involving not present satisfac- 
tion and future punishment (in which present satisfaction would almost certainly 
win), but rather present satisfaction in Opposition to present discomfort. as 
produced by the anxiety reaction in Tesponse to overt anti-social behaviour. 
If the Conditioning has been strong enough, ‘ conscience ' will win the day, and 
the individual will withdraw from the situations without giving way to ‘ tempta- 
tion.” Even if he does give way to ‘ temptation ’ there will still be a strong 
anxiety reaction to detract from his enjoyment, thus making it less likely for 
him to react anti-socially on the next o ion. 

It might be queried whether anxiety is in effect strong enough for this 
Proposed role. When we consider that many people have committed suicide 
rather than suffer strong anxiety, and when we consider that criminals have, 


However, such facts, of course 
question ; they merely illustrate the Strength w 


IIL—INpiviDvAL DIFFERENCES, 
It is well-known that there is a group of people, not sharply segregated 
from the rest, but presumably continuous with the remainder of humanity, 
which is characterised by anti-social behaviour in almost pure culture. I am 
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referring, of course, to the so-called psychopaths, i.e., to children, adolescents and 
adults, who, in spite of often high intelligence and good up-bringing, seem to 
be completely lacking in moral sense—so much so that they have even been 
called moral imbeciles. They will lie, cheat, steal, rape, and indulge in any form 
of anti-social activity without apparent regard for consequence, and without 
regard for their victims. Such people will often commit criminal acts for the 
slightest of gains, and in situations where discovery and severe punishment are 
practically certain. Here is a group of people for whose behaviour no adequate 
theory has been put forward. Can our theory do better? 

Let us note, first of all, that Pavlov already discovered very great individual 
differences in respect of conditionability between his dogs. Some dogs are very 
easy to condition, others very difficult, with the remainder in-between. This 
finding has been universally verified in the case of other animals, and also in the 
case of human beings. Differences in conditionability are by and large unrelated 
to such factors as age, sex and intelligence. If our theory is accurate, then we 
might be able to explain the behaviour of psychopaths by postulating that such 
people are endowed with a nervous system which is largely resistant to the 
formation of conditioned responses. If this were so, then the course of up- 
bringing would not suffice to produce in them the requisite anxiety responses, 
which we have postulated to lie as the basis of ‘ conscience.’ It should be easy 
to test this hypothesis by attempting to condition psychopaths in the laboratory, 
and by comparing their success or failure with the responses made by an 
average group of people put in the same experimental situation. When this is 
done, the results very strongly bear out the hypothesis, and it is found that 
compared with a normal group of people, psychopaths are, indeed, very difficult 
to condition, requiring many more pairings of conditioned and unconditioned 
stimulus before any effect is seen (Lykken 1957, Eysenck, 1957). Results, 
therefore, are in accordance with our hypothesis 

We might put forward another hypothesis to complement the previous one. 
There is a group of people in society who are suffering from unduly strong 
anxieties related to stimuli which, in the normal person, do not arouse anxieties 
atall. These people suffer from fears of open spaces, fears of enclosed spaces, 
fears of animals, and so forth, without being able to give any rational account of 
their fears. Such people constitute the main group of neurotics, whether in- 
patients or out-patients, at our hospitals and clinics ; they are sometimes known 
as ‘anxiety states.’ Such people also frequently show strong guilt feelings 
about actions which few people would regard as immoral or anti-social in any 
real sense of those terms. In other words, these people appear to have a 
conscience much more tender than the average person. In terms of our hypo- 
thesis we would expect such people to be particuarly easy to condition ; their 
symptoms and their behaviour could then be accounted ior in terms of a too 
ready conditioning of fear responses to a large number of previously neutral 
conditioned stimuli (Cf. ftn. p. 000). 

Again, the evidence strongly supports this view. The work of Spence (1 956) 
Franks (1956, 1957), and many others (cf. Eysenck, 1957) shows that people 
suffering from anxiety states are more easily conditionable than the average run 
of people, and accordingly, we have a complementary piece of evidence showing 
that we can arrange human beings in a continuum of ' conditionability ' from 
high to low, a continuum which runs parallel with a continuum of behaviour 
patterns going from hyper-moral through average to psychopathic and immor al. 
"There is more evidence in favour of this hypothesis, and some of it has been 
quoted elsewhere (Eysenck, 1957). Instead of going into this additional evidence 
I would prefer to draw attention to another extension of this general scheme. 
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Ihave argued that the personality dimension, at the one extreme of which 
we have the psychopath, and at the other extreme of which we have the anxiety 
state, can be found not only in the emotional, anxious and neurotic type of 
personality, but also in the population as a whole, and I have suggested that we 
are, in fact, here dealing with the well-known extravert-introvert typology 
transferred into the field of neurosis and maladaptation. Figure 1 will illustrate 
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this notion ; it is quoted from “ The Structure of Human Personality ” (Eysenck 
1960) and it is a diagrammatical presentation of a factor analysis of fifty traits 
which had been correlated by Ackerson (1942) among several thousand children 
who had been studied at the Illinois Institution for Juvenile Research. It will 
be seen that of the items included, all have correlations with the general factor 
of neuroticism or emotional instability, but that there is a division along the 
lines of the second factor into personality problems characteristic of the introvert, 
and conduct problems characteristic of the extravert. 


Much the same kind of result was found by Hewitt and Jenkins (1946) in 
a study of 500 problem children. When they correlated various notations from 
the case histories of these children, they found three main characteristics, 
ranging from ‘unsocialized aggression’ (the extraverted end) through 
“socialized delinquency ’ to ‘ over-inhibited behaviour’ (the introverted end). 
Other data are quoted in “ The Structure of Human Personality,” to support 
the existence of a general dimension of this nature. There is no direct evidence 
of a relationship between these behaviour patterns in children and their con- 
ditionability ; here would seem to be an area of research which could, with 
advantage, be pursued by those responsible for the moral welfare of children 
and for their disposal after they have come in contact with the law.* 


Among adults, however, there is considerable evidence in the work of 
Franks (1956, 1957), and others (Eysenck, 1957) to show that there does exist 
a significant relationship between introversion and conditionability, even when 
extreme cases of psychopathy are not included in the neurotic sample, and even 
when quite ordinary normal subjects are being tested. In so far as our hypo- 
thesis deals with conduct and patterns of observable behaviour it must, I think, 
be concluded that we are on relatively safe ground in putting forward the 
hypothesis that differences in conditionability determine in part the socialized 
or anti-social behaviour of children and adults alike. 


IV.—IMPLICATIONS OF THE THEORY. 

Does the theory have anything to say about the more verbalized type of 
values and attitudes? An attempt has been made in The Psychology of Politics 
(Eysenck, 1954) to make certain deductions from the theory, and to test these 
objectively. It would seem to follow from the theory developed here that 
introverts would be more concerned with ethical and moral prohibitions, with 
religious ideas and quite generally with the erection of barriers against the 
direct and immediate satisfaction of instinctual and libidinal impulses. Con- 
versely, it would seem to follow that extraverts would be more likely to favour 
the direct expression of sexual, aggressive, and other anti-social impulses and 
to be less concerned with ethical, moral and religious ideas. The proof of this 


* The reader may wonder why both personality problems and conduct problems have 
such high loadings on neuroticism, and whether adult criminals too would be found to be 
neurotic. It would appear from some recent work done with the Maudsley Personality 
Inventory on recidivist criminals, that their neuroticism is indeed almost as high as that 
of hospitalized neurotics. Some learning theorists, particularly Spence, regard neuroticism 
or emotionality as a kind of drive variable (D), which according to Hull's theory, is multi- 

lied by habit (sR) to produce behaviour (sër). If we regard, in a very rough-and- 
ready fashion, the extravert as a person in whom 'temptation'z' conscience’, whereas in 
the introvert ' temptation "E conscience,’ then the added drive produced by the emotion- 
ality-neuroticism variable might be thought to produce a much stronger reaction in the 
direction determined by the respective weight of these two influences. This is à highly 
speculative consideration and it would require a considerable amount of experimental work 
to decide if any real meaning is attached to it. 
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hypothesis requires two series of studies. In the first series, a number of correla- 
tion studies were carried out on the attitudes of quite large numbers of subjects 
towards a number of variegated social issues. When these attitudes were 
inter-correlated, they invariably gave rise to two main factors, the ever present 
one of radicalism-conservatism, which does not concern us here, and a second 
factor, independent of the first, the two poles of which were entitled tough- 
mindedness and tender-mindedness. The factor structure is illustrated in 
Figure 2, and it will be seen that tender-mindedness is a factor which appears to 
be characterised by those attitudes theoretically described as introverted, 
while tough-mindedness is made up of attitudes theoretically attributable to 
the extravert. Thus, a tough-minded person favours the overt expression of 
aggression (by flogging, death penalty, birching, etc.), towards out-groups 
(criminals, Jews, coloured people, etc.), and the overt indulgence in sexual 
activities (companionate marriage, easier divorce laws, the abolition of abortion 
laws). The tender-minded person has strongly favourable ideas towards 
religion and ethical ideals such as pacifism, etc. The first step in our proof, 
therefore, does not seem to contradict the hypothesis (Eysenck, 1954). 
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random sample of the population (Eysenck, 1960). The results were fully in 
line with those previously reported and seem to leave little doubt about the 
existence of some such relationship as that postulated. It would appear, 
therefore, that in the field of verbally expressed moral attitudes and ideas also 
the general hypothesis here put forward can make possible verifiable predic- 
tions. 

It will, of course, be obvious to the reader that there is no intention, in 
sketching out this theory, to account for all criminal or immoral behaviour. 
The nature of such behaviour, and the manifold determining causes which are 
involved are too complex for any single factor theory to be acceptable to 
students of the field. It is obvious that individual differences in condition- 
ability are important in deciding whether a particular conditioned response will 
or will not be established ; it is equally obvious that for such a response to be 
established, even in the most easily conditioned person, there must first of all 
occur a process of conditioning, i.e., of pairing the conditioned and the uncon- 
ditioned stimulus. Thus, in conditioning we are obviously involved in what one 
vaguely calls the social factors of character building, and it is, clear that these 
differ from person to person, and probably quite generally from class to class. 
Thus, there is some evidence from American researches, particularly the work 
of Kinsey (1948) and the Chicago school, that lower-class groups lay far less 
emphasis on the extinction of overt aggressive and sexual reactions than do 
middle-class groups. In these circumstances we would expect quite a different 
pattern of behaviour in middle and working class boys, even if there were no 
differences in conditionability between them, and these differences are, indeed, 
found. (Cf. also Eysenck, 1951.) 

Another point where environmental influences are clearly of the greatest 
importance is that relating to the strength of a given temptation which has to 
be overcome by the conditioned anxiety response we call ‘conscience.’ If, 
indeed, a person’s reactions are determined by the respective strength of 
' conscience ' and ‘ temptation,’ then clearly the same person may act morally 
on one occasion and immorally on another, depending on the degree of desire, 
hunger, anger, or whatever may be involved in the situation. A poor man, as 
has often been pointed out, is more likely to steal a loaf of bread than a rich 
man, because the temptation for him is so very much stronger; from comparing 
the actions of the rich and the poor in respect of stealing a loaf of bread, 
obviously no deductions can be made as to their likelihood to form conditioned 
responses easily. All these considerations will be obvious, but I have mentioned 
them briefly to avoid criticisms sometimes made of theories of this kind, to the 
effect that they do not take into account all the conditions of a complex social 
phenomenon. 

Nevertheless, our theory demands that at least to some extent we should 
recognize, and be prepared to discover, innate biological factors determining in 
some degree the moral or immoral, criminal or non-criminal reactions of human 
beings to certain types of situations. Conditionability, presumably, is a func- 
tion of certain features of the central nervous system, and as such is likely to 
owe much to heredity. Is there any evidence about the hereditary determina- 
tion of crime? The important and, indeed, fundamental work of Lange (1931), 
Stumpfl (1936), and Kranz (1936) has shown that such hereditary determination 
can, indeed, be demonstrated very clearly. As is well-known, these investigators 
located prisoners who had a like-sexed fraternal or identical twin. They then 
investigated this twin to see whether or not he also had been convicted of a 
crime, and if so, whether his crime was similar to that of his sibling. They 
found a very considerable degree of concordance for identical twins, and 
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much less for fraternal twins ; it is difficult to interpret the evidence in any 
other way, but as a strong confirmation of the heredity hypothesis. This 
conclusion is well expressed in the title which Lange gave to his book : Crime as 
Destiny. It may be noted that J. B. S. Haldane wrote an introduction to this 
book in its English translation, in which he fully concurred with the author’s 
conclusions ; this is relevant and important because of his expert knowledge 
in the field in genetics and because of his well-known political sympathies 
which would make him less likely to accept such conclusions with enthusiasm. 

Do any practical suggestions follow from the theory here developed? 
Two points seem to be worth making. In the first place, there has been far too 
little work of an experimental nature directly related to this problem to make it 
possible to decide with any degree of conviction as to the adequacy of the 
theory here presented. What is needed isa decade of concentrated experimenta- 
tion before we shall be able to come to a conclusion with any degree of confidence. 
It would, therefore, appear to be premature to make any detailed practical 
suggestions; these must await further qualification of theoretical issues and 
further experimental verification. 

In the second place, however, it does seem to me that if there is any degree 
of truth at all in the theory here presented, then it does lend some support to 
the slogan that punishment should not fit the crime but the criminal. Perhaps 
we might amplify this a little and say that character education cannot and 
should not be a uniform process, but that it should take into account the 
individual personalities involved. This saying, of course, is a truism which 
has often been repeated ; what I am suggesting is that it can only cease to bea 
truism, and become a guide to action, when we know what are the parameters 
directly involved in moral actions, and how these parameters can be measured, 
and if possible, affected by our actions. If it were true that conditionability 
plays as important a role as I have suggested, then it would seem to follow that 
the maxim “ spare the rod, spoil the child " could with advantage, be applied 
to the extraverted, possibly psychopathic, non-conditioner, whereas the 
modern free-and-easy methods of up-bringing would be much more appropriate 
to the introverted, anxious, easy-to-condition type of child. It may be possible 
that the acceptance of some such general rule might reduce the number of both 
behaviour problems and personal problems, which at the moment, appear to be 
ea to such an alarming extent. However, on this point also much 
urther research, possibly of an applied nature and carried out in the classroom, 


Will be required before we can say for certai i ion is li 
ertain whether this suggestion is likel 
to have the consequences vised, x = ” 
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GOAL-SETTING BEHAVIOUR, ANXIETY AND SCHOOL 
STREAMING 


By THELMA VENESS 
(Birkbeck College, London) 
Summary. A particular pattern of 


tion test was found to occur more 
“A” streams of modern schools tha: 


goal-setting behaviour in a level of aspira- 
frequently in technical schools and in the 
n in the lower modern school streams and 
in grammar schools. In previous research, this pattern, of setting high goals 
and adhering to them rigidly, whatever the fluctuations in performance, has 
been associated with anxiety and similar personality traits. Matched pairs of 
these children and controls were compared for other possible indices of anxiety 
in the material of an investigation of which this is a part. The findings are 


discussed in relation to the possible effects of success and failure involved in 
secondary school selection. 


I.—INTRODUCTION. 


THE work to be described forms a small part of an enquiry into the aims and 
aspirations of school-leavers: their vocational and other ambitions; their 
ideas, values and plans concerning the future. In this enquiry, directed by 
Professor C. A. Mace and assisted financially by a grant from the Nuffield 
Foundation, several techniques of enquiry were directed at young people at 


different types of school, all expecting to leave school in about twelve to eighteen 
months. Among the techniques used y 


: n vas a level of aspiration test. The 
essential requirement of such a test is a task which.can be repeated several 
times so that the subject, on the basis of his achievement on any trial, can set 
his goal for the forthcoming one. The conditions of the present investigation 
demanded a task suitable for administration to whole classes of children, so 
that each child could, without hel 


of the trial and record his aspir: 


S of goal-setting for each child. A particular 
r is the subject of this paper. 


IL—TuE CRITICAL FINDINGS. 
From the performance and aspiration scores so obtained, several numerical 
relationships can be calculated and used as indices for each subject. Of 
importance for the Present enquiry are two: 

_ (a) The goal discrepancy Score: the mean of all the differences between 
aspiration level and the immediately preceding performance score. In using 


22 
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this index, it is assumed that the subject decides his next goal on the basis of his 
present score ; i.e., that this index represents numerically the subject's liability 
to aim at targets above or below his likely attainment . 


(b) A measure of rigidity of goal-setting, or the tendency to remain at the 
same aspiration level regardless of fluctuations in actual performance. Here, a 
child was classified as ‘ rigid’ if not more than two different numbers were used 
in stating the ten goals. Since serial order was disregarded for this purpose, the 
following pattern, 10, 9, 9, 10, 10, 9, 10, 9, 9, 9, would qualify for inclusion. 
In no case so Classified was the actual performance so inflexible as to justify 
the rigidity of goal-setting. 

Of particular interest is the combination of high scores in both these 
measures, a pattern of setting a goal above likely performance and of then 
adhering closely to that goal throughout the test. In terms of the present data, 
this pattern represents the concomitance of the rigidity measure already 
described and a mean goal discrepancy score above the median of the total 
sample (4-1:0). Of the 778 children tested, in schools of one local education 
authority (a Home County), forty-four children gave this pattern of responses— 
a proportion of 5-6 per cent. Several had a very high goal discrepancy score 7 
as high as eleven in one case: thirty-eight were in the upper quartile for the 
whole sample. Eleven children made no change at all in their choice of goal. 


This goal-setting pattern, although uncommon in the present sample, 
merits attention because of its resemblance to those described by other investi- 
gators and associated by them with certain personality variables. P. S. Sears,® 
working with younger children, found that such goal-setters had lower ratings 
on self-confidence, made lower estimates of their performance on various school 
activities though this was associated with a higher wish for achievement, and 
on a questionnaire of emotional stability were more likely to indicate anxiety, 
tension and poor school adjustment. She describes children in this group thus: 
“ In general, they are not very well adjusted socially ; are tense, apprehensive, 
sensitive, insecure, worried and usually seem to utilize persistent non-adjustive 
forms of reaction to problems of their school and social life."$ Eysenck and 
Himmelweit! studied a similar pattern in adults and found it to belong largely 
to‘ the affective group,’ a term refering to a syndrome, later called ‘ dysthymia,’* 
the main characteristic of which is anxiety. D. R. Miller? described the neuras- 
thenics in his test population as being more likely, when threatened with 
failure, to predict high scores for themselves, and scores which do not change in 
relation to the changing situation. ' Neurasthenics ’ are here defined by vegeta- 
tive symptoms such as sweating, tremor, diarrhoea—symptoms usually associa- 


ted with anxiety. 

This pattern of goal-setting behaviour would seem, then, to be slightly 
pathological. The fact that only 5-6 per cent. of the sample show it is a welcome 
finding. Of particular interest, however, is the further finding that the forty- 
four children in this group do not represent equally the types of school and 
school stream tested. 

The first difference to be noted is between the proportions of what we might, 
for brevity, call ' anxious’ goal-setters found in the A streams of the seven 
modern schools and those in the lower streams of the same schools. A higher 
proportion of such children were found in the A classes, to a degree significant 
beyond the | per cent. level, using Brown's modification of the Chi Square 


test? and correcting for discontinuity. There was no discrimination according to 


age, sex or social class. 
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The first explanation of this difference that comes to mind relates it to 
intelligence, the more intelligent children being readier to adopt high, un- 
attainable goals and maintain them with tenacity. This hypothesis is not 
supported, however, when the grammar and technical school results are 
examined. The proportion is even higher in the technical schools whereas it is 
sand modern A streams combined 
have a higher proportion, significant beyond the 1 per cent. level, than have 
grammar schools and the rest of the modern streams combined. The relevant 


proportions are shown in Table |. 
TABLE 1 


PROPORTIONS OF ‘ ANXIOUS ' SCORERS. 


Anxious Scorers 
% 


School Stream D No. 


4-00 
Technical .... PN » 9-60 
Modern A ls A 8-95 

2-38 


. In seeking further explanations, and havin 
with anxiety, one is tempted to speculate about th 
finding themselves in certain types o 
results largely from an entrance exa 
to anxiety in parents, anxiety which might wel 
he report on Secondary School Selection 
l Society?, which refers also to cases, 
alleged to be associated with the selection exam- 


Te. The feeli f fail n 
be stronger with the tsi] que. le feelings of fai ure would, therefore, 


he very places Where we fou 


ne c i&&estions do not accord with comments in 
A 1t 1S argued that despondenc among grammar 
school d has been exaggerated ; that it is grammar aol dien who 
are more liable to be anxious and worried by the demands that the curriculum 

: er hand, Sears, 8 claimed that among the 
group of children she tested, those with the pattern of goal-setting ander dis- 
cussion were usually the children as having a past record of 
referred to the type of task 
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entitled ‘The Best Moment of my Life.’ A considerable proportion (14 per 
cent. of boys, 13 per cent. of girls) in the grammar school, sometimes seven 
ears after the event, wrote of the day when they had heard of their success in 
the ' scholarship.' Unfortunately, the complementary evidence is not available, 
i.e., the opinions of the other children on ‘ The Worst Moment of my Life ' ! 
These speculations cannot be pressed further at this point: unlike manu- 
facturers of cereals, we would need a great deal of hard evidence concerning the 
causes and effects of success at the selection examination. Further, it might 
well be suggested that the evidence we have cited from the level of aspiration 
test is a very slender foundation for a speculative superstructure attempting 
to explain anxiety in school children. The first task is to study these forty-four 
children further to see whether one is warranted in regarding them as anxious. 
Opportunities for a close examination of the individual children were not avail- 
able. The material at hand, therefore, i.e., the material collected in the main 
investigation, had to be scrutinised for further clues, and this scrutiny com- 


prises the remainder of this paper. 
III.—PLAN OF FURTHER INVESTIGATION. 


To make more rigorous the search for further signs of anxiety, the forty- 
four children were compared on various indices with a control group of children 
matched, child for child, in several respects. The matched pairs came from the 
same class; this controlled for level of schooling and for sex, and adequately 
for age and general level of ability. They were matched, also, on certain level 
of aspiration measures. Thus, every control child had a positive goal discrep- 
ancy like its pair. As far as possible a child was chosen with a goal discrepancy 
score approximately equal to that of the child to be matched, but the ‘ anxious’ 
group tended to be so high in their goals that a precise match was not possible. 
The total goal discrepancy score for the critical group was 131 and for the 
control group, 100. They were further matched on another level of aspiration 
measure not yet mentioned, which might be called a measure of reality. It is 
an index derived from the supposition that experience of ‘ failure ’ (that is, 
falling short of the target adopted) should not lead to a raising of one’s goals, 
nor an experience of success to a lowering of one’s goals. It is not necessary 
to describe in detail here how the index was calculated, except to say that it was 
derived in such a way as to be independent of the measure of rigidity which 
supplies the crucial difference between the groups. An alternative estimate of 
this rigidity can be calculated from the total number of shifts in goal made. 
On this measure, the rigid group has a total of seventy-six shifts and the control 
group 216. . . 

In the next section are listed seventeen specific items on which the two 
groups were compared, together with the prediction made and the results found. 
A methodological point to be noted here is that, for many of these items, the 
individual child's score had already been assessed, independently of the present 
investigation. Where this was not the case and a new assessment was needed 
for testing a specific hypothesis, the two groups were put together so that the 
investigator did not know which were the ' anxious ' children. 

Before discussing the specific items, the five techniques of enquiry from 
which they were drawn must be described. They are as follows : 
A.—An essay in which the children were asked to imagine tha 
near the end of their lives and tell their life stories from the time they left 
school Time checks were given after 30 minutes to ensure that during the last 
ten minutes they would give attention to the end of the story. Items 1 to 7 


refer to this material. 


t they were 
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B.—A ten minute essay entitled “ The Best Moment of my Life.” Items 
8 and 9 are taken from this. 

C.—A questionnaire in the form of twenty-two statements bearing on values 
related to jobs. Each statement was first marked as Important or Not Import- 
ant in one of two columns. The children then marked in two further columns 
those that were Very Important to them, and then the two that were Most 
Important. A weighted score for each statement is derived. Items 10 and 11 
refer to two of these statements, 

—A questionnaire of the usual form from which come items 12 and 13. 
E.—An individual interview from which items 14 to 17 are taken, 


IV.—Rzsurrs. 
The seventeen items and the actual finding on each are as follows : 
(1) Balken and Masserman,1 comparing the lengths of Thematic Apper- 


ception stories given by neurotics of three kinds, found that those of people 
classified as anxiety cases were the shortest. It was predicted here that the 


but no significant difference was found using the Wilcoxon Test : in fact, in 
only twenty-three out of forty-two Pairs did the ‘anxious’ child write a 
shorter essay. (Occasional absence from school for parts of the investigation 


“anxious ' children being predicted, Again, the 
by the Wilcoxon Test: nineteen out E: for 


operate in this situation from one in whi 
there is a known time limit. 


(3) It was Predicted that, in the tellin 


ife et - $ 
would be more likely to refer to the Savings s. iy theory. ütisinus iq 


i i mentioned by nine of the fort -four 
anxious ' children compared with eight controls, a difference not statistically 
significant 
(4) 


: t os DH » that more frequent references would 
occur in the ' anxious group. Again, the di 


being nineteen ‘ anxious? children and eightee 
(5) A further tally from the li i 
accidents and disasters; The prediction h : ious’ 
again guided by their clinging to security 
anxiety, would be less likely to describe 


e 
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stories. Here, the results are in the predicted direction—eleven ‘anxious’ 
versussixteen controls out of forty-four—but do not reach statistical significance. 


(6) The essays had also been scored previously for evidence of conflict 
such as uncertainty about a choice of career. It was predicted that the incidence 
would be higher with the ' anxious ' group. The figures are seven (the ' anxious ' 
group are always given first) as against four out of forty-four, not a significant 
result. 

(7) The final item from the essays is the incidence of mention of their own 
parents. The prediction—again on the grounds that security is important to 
the ' anxious ' group—was that the incidence would be higher here. This was 
so, the figures being sixteen and eleven out of forty-four, but the difference does 
not achieve significance. 


(8) With the suggestion in mind that an examination had been of import- 
ance to these ' anxious’ children, reference was sought in the “ Best Moment 
of my Life ” essay, to examinations of any kind, in the expectation that the 
incidence would be higher for the ‘anxious’ group. In fact, the figures were 
only six and five out of forty-two. 


(9) This same short essay had previously been scored for themes concerned 
with ‘ relief from anxiety,"? which would include, for example, recovery from 
illness or the return of a parent after a separation. It was expected that such 
themes would occur more frequently in the ‘ anxious’ group. This was so, but 
the proportions were too small for statistical test, being seven and two cases 
out of forty-two. 


(10) This and the next item relate not directly to anxiety but to obsession- 
ality, and its concomitants. To maintain a goal rigidly in a level of aspiration 
test might well be regarded as obsessional behaviour, and Eysenck's dysthymic 
group does contain obsessives as well as anxiety cases. From the questionnaires 
described above (C), two items were extracted, the items which, in the whole 
investigation, seemed to have the closest reference to obsessive-compulsive 
tendencies. On the assumption that the ' anxious’ group is in fact anxious 
rather than compulsive, the prediction was made that it would ot score higher 
than the control group on these items. i 

Item 10 refers to the statement, “ I should be able to keep clean and tidy 
at work.” As described above, a weighted score is extracted for these items 
each child can score from 0 to 3 according to the degree of importance attached 
to it. The weighted scores total 56 and 59 for the two groups, supporting the 
prediction. 

(11) Here the statement is “ I should be able to finish one job before I 
have to think about the next.” The weighted scores of the ‘ anxious ' group total 
forty-one, of the control group fifty-nine. On a Chi Square test, comparing 
the distributions of scores, from 0 to 3, the tendency of the ‘ anxious ' group to 
score lower is confirmed at the 0-5 per cent. level. 3 

(12) Prior to the present investigation, an assessment of ambitionsness 
had been made on the basis of four questions, asking for expectations of t s 
first full time job, the job after two or three years, after nine or ten as s, m i 
just before retirement. Signs of upward social mobility were the a medi 
ambitiousness. The prediction made here, suggested chiefly by Ser. c E C 
discrepancy score of the ‘ anxious ' group, is that more of them would hav = 
scored as ambitious. In fact, fourteen out of forty-four of them ur x 
against ten of the controls. The difference is in the predicted direction, but no 


significant. 
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imi ssment had been made of ambitiousness, but 
tel n eedem Berge Son what the child wanted to own at different 
iode m Hide the results were in the contrary direction—nine versus 
oo t ot forty-four. This is a difference low in statistical significance, 
epe on en suggests that the prediction was wrong: that to express 
baie Fd im through the accumulation of possessions, as distinct from 
coe eese demands a more expansive outlook than can be expected of 
he anxious. 


remaining items come from the interview. The first question 
To NS child to falls dhe his common daydreams. _ It was "M P ia ca 
that anxious children would be of more stunted imagination an Nope 
unable to answer this question. The difference found (ten versus eig! 
forty) is not significant. 


i sed: “ find that when you 
15) A further question asked : On the whole, do you 1th 1 
- E very much you usually get it? " followed by me ually 
makes this ‘ come off '? " and “ What is usually the reason if it coen -e : 
off’? ” A count was made of the references to personal earnings and saving: 


(16) One question asked the reasons for people ' doing well, Offering a 
It was predicted that more ‘ anxious : 
children would choose ‘ hard work,’ but, in fact, this was a common stereotype 
and the incidence was high in both Broups—thirty-one and thirty out of forty. 
(17) The last item refers to the question “ If you could have any job at all 
in the world and didn't have to worry about money or anything else, what 
would you choose? " Asin item 14, it was predicted that more of the m 
group would be unable to answer. The difference was not significant, th 
frequencies being seven and five out of forty. i 
"Finally, the total of signs shown by each child was ey ee 
quartiles for items | and 2 and omitting items 10, 11, 13, and 16 for D. did 
that will be obvious from the discussion of these items. In only m af che 
the control score higher than the “anxious ' child, and the applica achin ihe 
f test for paired observations to the total data yields a ż closely approaching 
l per cent. level of significance. 


V.—Concrustons, : 
That the examination of single signs of anxiety has yielded little of 
ng in view of the following two points : 
(a) The‘ anxious’ group here was not composed solely of extreme scorers ; 
Tepancy score almost down to the median was accepted 
to make a group large enough for further study. 
(b The material analysed was not Originally intended for the purpose of 


; incidental Signs, only, are the object of the search. 
Nevertheless, taken together, the differenc 


ing life, concerned to fing and retain security and 2 
ight arouse anxiety. This, perhaps, points the way to 
Finally, it is not possible, in 


the present study, to say anything further about school streaming and its 


relationship to personality. 
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A NOTE ON THE RESPONSES MADE BY SECONDARY 
SCHOOL CHILDREN IN THEIR DEFINITIONS OF WORDS 


By DONALD G. BURNS 
(Education Department, University of Leeds) 


Summary. Investigations into the nature of children's definitions commonly 
imply some classification based on the formal characteristics of the definition 


impossible to group definitions with confidence in categories of this kind and 
that any attempt to assess their value should be based on criteria of a psycho- 
logical kind. 

Definition by emotional tone appears to be one of the most rudimentary 
types of definition and though such definitions have frequently very little social 


value, they evidently occupy an important place in a psychological scale of 
values. 


Definition that associates meaning with a specific situation often appears 
superior to definition by emotional tone because it suggests a more socialised 
and mature concept: but within this category, many levels may be noted, 
ranging from the proto-class to the purely emotional. 

Definitions that are Beneric in type, seem to reveal greater social maturity 
and would appear to be superior logically and psychologically to definitions in 
terms of specific contexts or emotional tone. Four kinds of definition of this 
type are distinguished. 

Various levels of definition are to be found within each of these main 
categories and it is suggested that the level of definition and its category should 
be considered separate criteria in any scale of semantic values. In general, the 
category indicates the extent to Which the definition may be logically adequate 
and the level, the extent to which it reflects maturity of understanding. A 
good definition is satisfactory by either Criterion but any definition that is less 
complete than this, should presumably, be judged by a criterion of conceptual 


as well as logical adequacy and in many situations (e.g., psychological tests), the 
former may be more useful than the latter, 


I.—Intropucrion. 

THE general characteristics of definition are that it should state the class to 
which an object belongs and the features that distinguish it from other objects 
in the class. Children’s definitions often fail to satisfy these criteria, not merely 
because their knowledge is limited, but apparently, because their concepts are 
immature; and if this is So, there seems to be a strong case for assessing 
definition-answers by criteria of a Psychological as well as a logical kind. The 
following note records some of the Psychological considerations which were 
taken into account in marking children's definitions from this point of view, 
and also some of the difficulties that seem to be implicit in any gnomic classifica- 
tion of definitions, during an enquiry that has been completed recently in Leeds. 

A large group of senior pupils in the secondary modern schools of the city 
was asked to give written definitions of a number of words as part of a study of 
vocabulary achievement in the English language. Some 4,150 words were 


ge secondary modern schools, 


=o : year of study. "These pupils appeared (from 
empirical enquiries made in the schools about their worn ele er to be 


representative of all children attending the Secondary modern schools of the 
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The I.Qs. of children in the group ranged from 70 to 115 (according to a 
Moray House test administered at 10 plus) and as the enquiry covered virtually 
all children in secondary modern schools at that time (1955), the results may be 
assumed to be fairly representative of all children at that stage of education in 
the secondary modern schools of the county borough. Further details of the 
method of enquiry and the basis on which words were selected have been given 
in Occasional Paper No. | (to be published by the National Foundation for 
Educational Research). Each pupil was asked to give written definitions of a 
proportion of words in the list which were presented in a brief context—the 
simplest way of showing them in use—and the definitions were then given a 
mark on a three-point scale (1, 3 or 0). 


II.—FonRMs OF DEFINITIONS USED BY CHILDREN. 


Many investigations refer to characteristic differences in the quality of 
response between older and younger children when they are defining simple 
words. According to Terman and Merrill, “ an adult gives a response which 
classifies a common object, but also isolates it by excluding other objects which 
do not belong to the class; both a six-year-old and a sixteen-year-old know the 
meaning of orange— It is round and has seeds’ or ' it is a citrus fruit,’ but their 
thinking is characteristically different" (Terman and Merrill, p. 303). Piaget 
claims that the thinking of the young child is based on intuition and personal 
schemas of analogy and that his interpretation of experience, verbal or other- 
wise, is characteristically egocentric; “ childish egocentrism seems to us 
considerable only up to about 7 or 8, the age at which the habits of social 
thought are beginning to be formed "' (Piaget, p. 127). 

Burt, too, found that definitions by description or in terms of the class to 
which a thing belongs are characteristic of older children (Burt, 1947, p. 42) ; 
and Feifel and Lorge confirmed this conclusion in an analysis of the responses of 
some 900 children (aged 6-14) to the Stanford-Binet vocabulary test. These 
investigators found that younger children responded more frequently with 
explanations in terms of use and description and tended to interpret words in a 
concrete sense whereas older children responded more frequently with synonyms 
and tended to stress the abstract features of word-meanings ; the commonest 
of all forms of definition among children aged 9 or over in this enquiry was 
definition by synonym. Baranyai, in a recent article in this Journal, quotes 
further evidence on the verbal comprehension of children (aged 8-10) as shown 
in oral definition, which is interpreted to show a progressive development in 
children’s thinking from the concrete to the abstract. 

The evidence from all these sources suggests that a close parallel may be 
drawn between children’s physical age and their ways of thinking or interpreting 
words, and one would accordingly expect the definitions of a large and represent- 
tive group of children such as that examined in Leeds to provide further evidence 
of these characteristic ways, of interpretation. A scrutiny of the large number 
of definitions received in this enquiry showed, however, that children at 13-14 
years of age use many different forms of definition to explain the meaning of a 
word, depending apparently on their knowledge of the word and their under- 
standing of what was required of them. Words they did not know well were 
commonly defined in simpler and less ‘ mature ' ways (i.e., in terms of use or 
relation to other things) than words that were obviously more familiar, and it 
was difficult to discern evidence of a preference for any one form of definition 
over others. ] i 

The following examples, grouped according to the logical categories suggested 
by Robinson (Definition, 1951), have been quoted to show the variety of forms 
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that children’s definitions may take at this age, and incidentally, the difficulties 
that may arise in distinguishing between them except in the most empirical 
way. . 

ý Definition by synonym.—Many children sought to define a word by using a 
synonym, according to Robinson a form of definition that is most successful in 
defining names of organic kinds. Definitions of this kind have obvious limita- 
tions which would seem to bear out the common doctrine that ‘ there are no 
synonyms’ : 

They cross the abyss canyon 

The car has no accessories spare parts 

A pair of antennae horns of insects 
He serves in the artillery gun army 


He received a bonus an extra wage during the year. 
Covered with bristlles hard hairs 


He fell on the bunk bed on ships 
News of the catastrophe terrible happenings 


A group of cavaliers Royalist supporters 
A carbon deposit soot 


The dance was cancelled called off 
Made of chalk kind of white rock 


Definition by analysis.—Others sought to define a word by analysing it : 


Her accession to the throne 
An outstanding accomplishment 
Saved from bankruptcy 


coming to the throne 

Something to be proud of 

when you've to pay a lot of money and 

; you haven’t got it. 

He advertised the sale put a notice telling people to come 

An alluring picture draws your eyes , 

He opened the bureau building from which you find out things 

Water in the carburetter it holds the petrol as it comes from the 
tank 

anyone who sees a film before it goes round 

held in a place without supplies. 


Approved by the censor 
Besieged for months 


Definition by synthesis. —Some children attempted to define a word by 
stating its relation with other objects that are commonly associated with it, 


a method which seemed successful provided the synthesis was close and 
unambiguous : 


They gathered acorns from an oak-tree 
He fell on the bunk it was two-tiered 
Suffering from concussion where you lose your memory 


Definition by implication. —Others implied that the concept was already 
familiar to the reader and gave examples of things to which it might refer. 
The logical inadequacy of definitions of this type is evident and inevitably 
prompts the question “ Under what conditions can a partial statement be 


assumed to reveal an adequate knowledge of the concept ? ” 
Interested in agriculture 


crops a; imals 
Infected with bacteria Erreur sors 
Here is your baggage trunks and cases 
Infested with bugs insects and flies 
Blamed for carelessness for spilling something 
Catering for hundreds making tea, etc. 
A crop of cereals : 


el . crop of wheat, barley and corn 
A superstitious belief she would not walk under a ladder 


B. 
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The denotative method.—In some answers, the definition showed the word in 
use in a different context. This is clearly an alternative form of definition by 
implication : 


Where is your accomplice ? the thief and his accomplice got away 

A sharp angle thirty degrees is a sharp angle 

An annual event it was the annual fete-day on Saturday 
She accepted the bouquet bouquet of flowers 

He has been bitten bitten by a sake or a dog 


The ostensive method—One other method which was observed in some 
answers seems to depend more or less directly on the learner's experience of the 
example. This also appears to be another form of definition by implication : 


They reacted favourably. if someone stuck a pin in you, you would 
react 

Within my recollection when you remember 

A number of restrictions things you must not do 

Accused of selfishness all for yourself 

A difficult target something you aim at 


The difficulty of distinguishing between these different forms of definition 
is evident. Some of those classed above as ‘definitions by synonym,’ for 
example (because they are attempts to define by description or in terms of the 
class to which a thing belongs), might equally be considered examples of 
definition by implication. Is soot a synonym for carbon or an example of it ? 
And are antennae exemplified or paraphrased by the definition ‘horns of insects’? 
Similarly, the form of definition offered for cereal (a crop of wheat, barley and 
corn) and here classed as definition by implication, might be judged an example 
of definition by synonym. In such cases—and there were many of them—it 
was impossible to group definitions with confidence in any of the logical categories 
mentioned above, not did it seem that any particular significance could be 
assigned to the phrasing of a definition, whether this was in the form of a 
synonym, description or statement of the class to which a thing belonged. The 
only alternative in the present enquiry was to attempt to frame more detailed 
criteria of a psychological kind and the possibility of formulating these is 


considered in the next section. 


III.—STAGES IN THE DEVELOPMENT OF A MATURED DEFINITION. 


Most psychologists would agree that the young child tends to apprehend 
the world as dynamically related to and dependent on himself and Allport 
writes that “ between the primitive and the child and his environment, there 
exists a raw or syncretic relation illustrated by fusions between dreams or 
imagery and perceptions of outer reality . . . (and) it is only in the most highly 
developed individuals that one finds a sharpening of the polarity between 
subject and world " (Werner, 1958). Werner, in developing this thesis, has 

roposed that certain types of mental organisation or structure are character- 
istic of particular levels of development and may be deduced from parallels 
observed in the form and structure of the thinking, conceptualising and perceiving 

{ primitives, children and psychotics. This too, is essentially the view of 
Pi E though many would disagree with his conclusion that egocentric and 
Bnet modes of thinking are characteristic of a particular stage of childhood. 
pw it seems probable, as Vernon suggests (Vernon, 1955), that such modes 
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of thinking represent stages in the development of each class of ideas which may 
occur at any age. But the general conclusion seems clear : the child's concepts 
develop gradually and are continuously being modified as a result of experience 
so that what were originally ‘ raw syncretic relations ' become progressively 
more informed and more sharply determined. Defining words is a function of 
this process of socialisation just as much as other forms of mental behaviour 
(e.g., making logical deductions), and an assessment of the value of a definition 
is, in effect, an assessment of the extent to which the child has assimilated the 
social implications of the word. 


There are marked differences, however, in the extent to which a word can 
be defined with accuracy. Some have a meaning which can be communicated, 
and therefore assessed, with some precision because the senses in which they are 
used are closely prescribed by custom ; examples of these are the large number 
of substantives and verbs which can be defined ostensively like boot, door, walk, 
and vide. Others have developed meanings which are frequently less precise 
because the reference is largely determined by social contacts, and definitions 
of these cannot be assessed with the same confidence ; examples are amenities, 
dingy and elegant. And there are others for which the field of reference may be 
almost wholly determined by personal experience (beautiful, the establishment, 
image) and interpretation then becomes a matter of individual judgment. 
It follows that children may develop a satisfactory knowledge of some words 
much more quickly than others and that their familiarity with more complex 
concepts may have to be assessed by some predetermined standard of definition. 
An arbitrary standard of this kind seems necessary, for example, in assessing 
definitions of abstract words and words used metaphorically. 


One of the least sophisticated modes of definition and possibly one of the 
most rudimentary, is definition by emotional tone. Definition of this kind is 
readily understandable in early life since, at that stage, all the child’s articulate 
sounds are the expressions of feelings or conational attitudes. Meumann and 


Some emotional tone which is felt to be common to them before they learn to 
Scriminate attributes more exactly. One of Valentine’s children, for example, 
appeared to use the word hot as meaning either gencral unpleasantness or both 
very hot and very cold (Valentine, page 414). O'Shea quotes the case of an older 
child (aged 4) who began to use the Word imagination in the sense of any 
keen catlty in objects, a d ) 
e eers used it in his presence (cf the hrase “ ery lively imagination” 
(O'Shea, p. 138) . Werner, de fade it ee test Pee tad pe primi- 
tive ways of thinking in the interpretation of meaning in emotional terms 
(“ physiognomic perception ") cites many examples of similar usage among 
3 y be described as ‘ proud’ when it 

Tecarious angle ; just as a figure with 


different individuals may be suffici si 


basic dimensions of meaning, the other dimensions being potency and activity 


(Osgood, Suci and Tannenbaum). Definition by emotional tone is evidently 
not a form of definition used only by young children (though it is typically 
egocentric in Piaget’s sense of the term) and many examples of definition of this 
kind were found among the answers of children aged 13 and 14 in Leeds. 
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An aristocrat by birth comedian 
He abused her kindness scorned 
An advantageous position good 
An austere man wicked 
He is a bolshevik untidy person 
He became Lord-Chancellor someone clever 
A chronic disease very bad 
A well-known communist powerful 


Definition at this level seems to be a pre-cognitive response, and identifica- 
tion apparently occurs between any elements that share the same emotional 
context. This might explain many of the instances of definition by antonym 
which occurred in children's replies in Leeds. A confusion between, e.g., dusky 
and colourless, may be due to association on an emotional rather than a cognitive 
level and is certainly not unlike the confusion of Valentine's child between hot 


and cold : 


A dusky skin colourless 

This is a harmful drug won't hurt 

In a kilarious mood in a bad mood 

The place is inhabited abandoned: a deserted place 
An immigrant family to go abroad 

Of inferior quality best: better quality 
Iam indebted to you I don't owe you anything 
An irresponsible person take charge of things 

He loathes this food likes 

She was my predecessor Successor 

Living in poverty luxury 

He insisted on punctuality lateness 

She preceded me came after 


A clear and easily recognised step in the development of a more socialised 
concept appears to be revealed when meaning is identified with a specific context : 


I conjure you to give up magic: magician 
A closed vehicle black maria 


Definition which seeks to locate meaning, as here, in terms of some finite 
and cognitive experience, reflects a notion of the concept which is potentially 
superior to definition by emotional tone, however incomplete it may be. 

This also appears to be a form of definition to which children regress when 
nable to apprehend or express wider concepts. Watts observed from 
a survey of the errors of children aged 11 and upwards in a proverbs and idioms 
test, that when they were 1n doubt or ignorant of the meaning of a proverb, 
they selected a sentence which corresponded as closely as possible to the literal 
meaning of the proverb and this is analogous to definition in terms of a specific 
context, which is also an attempt to identify generic meaning with the literal 
one. He notes also that “ In seeking to select what they regarded as the literal 
meaning of a proverb, children seemed to look first for a sentence containing 
some striking word which they remembered from the corresponding proverb k 
(Watts, p. 215). Thus, " One man’s meat is another man’s poison x 
frequently identified by children as meaning the same as “ Beware o eating 
poisoned meat when dining with a stranger." . 

Watt's procedure is very similar to that reported by Pia 
interpretation of the results is quite different. In many cases, 


they are u 


get though his 
the children 
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> ing of proverbs by Piaget at the age of 9, 10 or 1 1, did 
guest on The mewing oi pem " though they evidently thought they did 
den they asked for no supplementary explanation of their literal or hidden 
meaning (Piaget, p. 129) : Matt connected the proverb so often goes ihe jug 
to the water that in the end it breaks ” with the sentence As we grow older, 
we grow better "and when asked why these two sentences meant the same find 
said “ Because the bigger you grow, the better you are and you n d ; 
(i.e., the jug is compared to a child simply because both grow older). ec 
considers this to be evidence of the syncretistic reasoning which is characteristic 
of all children at this age ; and Watts interprets it as an association between two 
fundaments which have some superficial or literal identity. TM 
It seems probable that the degree of syncretism revealed by i 
responses in situations of this kind depends very much on the comp exi yo 
the material. The more concrete examples offered by Watts, for exemple; may 
have facilitated the identification of the proverb with a literal psp Bonn 
and this is a procedure which seems to be analogous to definition in terms of a 
specific instance. > 
p A number of the definitions given by children in Leeds seem to exemplify 
this regression from a broader concept (which they were unable to explain) to a 
narrower definition based on some specific experience. The definition an 
apparently framed to express a generic concept of the word but was qualifie 
or explained by the addition of details which make it analogous to other forms 
of definition with a specific context : 
Along the embankment 
A feeling of guilt 
They paid homage 
He received a pension 
He became premier 
Classified as a quadruped 
To safeguard our health 
It contains a Supplement 
Serving as a trooper 


place where trains run 

feeling of dishonesty . 

tributes for the last time 

reward for something you have done 
first prime minister 

has three legs 

Somebody who guards people e 
something free—a knitting patte 
man who bails from plane 


All definitions that are generic in kind suggest a more fully e 
understanding of the concept than definitions in terms of a specific CONUS 
various forms of generic definition were noted in children’s iy adt 

One of these was the plain definition of the word by Staung n [eA 
(Orange is a citrus fruit). The following are examples of densifons of t ES a 
all of which are evidently incomplete because they fail to state the dlistimguis UR 
marks by which a concept differs from other members of the same class : 

This will not kindle 


burn 
The curtains are ornamental for show 
A murderous blow he killed him with a blow " 
He found the parchment Paper with some important thing on 
Living like a peasant poor man 
The origin of the quarrel Cause 
He speculated on the matter look into 
He was summoned yesterday found guilty 
Transparent materia] invisible 
Wielding a sword Swinging 


In other definitions, the class c 
made explicit. This may be beca 
knowledge in words or simply bec: 


oncept may seem to be implied but is not 
use children were unable to express their 
ause they felt they were only expected to 
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show they could identify it in an approximate way. The fact that children’s 
awareness of meaning may outrun their ability to express it has been noted 
elsewhere. O'Shea writes, “ I asked H., at seven to define loveliness. I can 
tell from her response when the term is used, in conversation or reading, that she 
understands it in its fundamental reference at any rate, and she can use it 
quite effectively ; but she cannot state formally just what meaning it has for 
her. “ I know but I cannot tell,” she says . . . if I press H. for some statement 
about loveliness, she will fall back upon the simpler term ovely—“ when a thing 
is lovely, it has loveliness.” (O'Shea, p. 146). Older children must face the same 
problem and may, as Werner and Kaplan have suggested, attempt to bridge the 
gap between the general and the specific by using phrases such as ' a kind of,’ 
"like a, or ‘sort of.’ Such formulae seem to argue an awareness of the need for 
a generic definition, and to this extent, may be held to reveal a more mature 
concept than any form of specific definition but they can hardly be considered 


the equivalent of a class concept. : 


A compartment to himself kind of small room 
They ran the blockade kind of barricade 
A closed vehicle a car or something 
She entered the convent kind of school 
Shaped like a cone like an ice-cream 


Other definitions that were phrased in this way, included some mention 
of the characteristic features of the concept : 


The boar charged like a pig with tusks 
She wore a coronet jewelled hat like a crown 


A fourth form of generic definition—and one that was generally most 
satisfactory from a logical point of view—was that which stated the class as 
well as the distinguishing features of the concept : 
cake made with nuts called almonds 
for ever questioning 
made love for fun 
examined carefully 
gallop gently 
drank a little at a time 


An almond cake 

An inquisitive person 
She often flirted — 

He scanned the horizon 
He cantered into the yard 


She sipped her tea 


There were many words, however, that children of 13 and 14 could hardly 
be expected to define adequately per genus et differentiam because they lacked 
the maturity or specialised experience of the adult and here, some less exacting 
form of definition must probably be considered adequate. Burt has noted that 
few children are able to give more than an approximate definition of words like 
kindness and justice before the age or 12 of 13 (Burt, 1955, p. 352), and 
younger children tend to define such complex concepts, as might be expected, 
in terms of specific instances. “ Punctuality means coming to school ata stated 
hour, discipline means being ready to obey definite rules on given occasions, 
truth means saying precisely what you have seen and heard and done and so p 
(Watts, p.26). Definitions at this level are, perhaps, evidence of a napina a 
of the words in these situations rather than of any inability to deal with a S n 
concepts as such ; and the apparent inability of older children to giv ee = 
definitions of more complex concepts can. be similarly explained. E. E oed 
acquaintance with a word is limited to two or three contexts, whic «eee d 
small basis for determining genus and species. In such cases, any 
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that states or implies the generic relationship and exemplifies it in a particular 
way, would seem to be satisfactory, despite its logical limitations : 


The accusation is unjust you accuse somebody when you say— 
you've done it 
The city corporation people to whom we pay rates and in 


return, they clean the streets 
He showed great friendliness kindness to help 


IV.—THE INTERPRETATION OF WRITTEN DEFINITIONS, 


In a review of written definitions, there must be many aspects of conceptual 
development that escape analysis owing to the form in which the definitions 
were given. It is difficult for example, to discriminate with confidence between 
varying levels of the wide range of emotive definitions when these are given in 
written form. Other aspects of conceptual development such as the ' plural 
concept '—the genetic predecessor of the universal concept, according to Stern— 
have been omitted from the discussion because they seemed to be of marginal 
interest and could only be adequately considered in a much longer paper. The 
limitations of any study of conceptual development based on written definitions 
are indeed self-evident and the chief interest of the present survey may lie in 


the evidence it provides of concepts at virtually every level of development 
among a group of older children. 


According to this evidence, it would seem that the most rudimentary type 
of definition assumes the form of an equation between meaning and emotional 
tone (e.g., aggressive is bad-tempered, austere is wicked). All definitions that 
have an emotive rather than a cognitive basis have been grouped in this general 
category but the value of the specific definition will evidently depend on its 
social relevance. Some (e.g., definition by antonym) may be judged to be more 
relevant than others (e.g., the syncretistic interpretations quoted by Piaget) and 
a few may have passed into current use (e.g., The coarseness of the cloth... 
thickness: A chaste life . . . hard, stern), Whatever the social use of such 


definitions, they clearly have an important place in a psychological scale of 
values. 


Definition that associates meaning with a specific context appears to be 
markedly superior to definition by emotional tone because of the evidence it 
should offer of a more socialised and mature concept ; but here again, many 
levels of definition were noted ranging from the proto-class concept to the 
purely emotive. Some suggest a broader concept and imply a regression to a 
narrower interpretation by quoting some specific experience (e.g., Pension— 
reward for something you have done), some are practically valueless because 
the specific context is incongruous (e.g., the young boy’s use of imagination) 
and others that are based on a superficial resemblance between fundaments or 
some form of syncretistic reasoning, are psychologically indistinguishable from 
the emotive definition. 

Definitions that are generic in type, seem to reveal greater socia] maturity 
and are logically and psychologically superior to definitions in terms of a single 
context or an emotional tone. Several kinds of definition of this type were 

observed : 


(a) Definition stating the class concept. 
(b) Definition implying the class concept. 
(c) Definition stating class and significant characteristics. 
(d) Definition implying class and stating characteristics. 


mr 
sy 
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A number of definitions that stated the class concept seemed to be satisfactory, 
no doubt because the synthesis between word and class-concept was close (i.e., 
they were virtually synonymous) ; befell—happened, needless—unnecessary, 
slaughtcr—killing. The difference between these and definitions that implied the 
class concept by using phrases of the kind ‘sort of,’ ‘like a,’ ‘a kind of,’ seems 
to be only one of degree. 

For many concepts however, definition by class and characteristic is the 
only satisfactory form of definition (Cantered . . . galloped gently, sipped... 
drank a little at a time, scanned . . . examined carefully). Logically adequate 
definitions of this type for complex concepts can hardly be expected from children 
of 13 or 14 ; and it seems reasonable to consider any definition satisfactory that 
relates the class concept to a specific instance, assuming of course, that both 
concept and instance are appropriate (Corporation . . . people to whom we pay 
rates and in return they clean the streets). 

Varying levels of definition may accordingly be found within each of the 
main categories postulated by this analysis and it would seem useful to consider 
the level and the category (as here defined) as separate criteria in any scale of 
semantic values. In general, the category indicates the extent to which the 
definition may be logically adequate and the level, the extent to which it 
reflects maturity of understanding ; and a whole range of definition may be 
offered for any word by either of these criteria. A good definition is satisfactory 
by either criterion because it defines per genus et differentiam and by the same 
token, reveals a mature concept; but any definition that is less complete than 
this must presumably be judged by a criterion of conceptual maturity as well as 


one of logical sufficiency. 
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THE PERFORMANCE IN ENGLISH LANGUAGE, AT ‘O’ 
LEVEL, OF A SAMPLE OF UNIVERSITY STUDENTS 


By E. A. HEWITT 
(Durham Colleges Education Department, University of Durham.) 


Summary. Results in the 1953 and 1954 Cambridge examinations in English 
Language, at ‘ O’ level, have been obtained for 1,423 boys and 882 girls, from 
197 schools, who were subsequently admitted to universities. Significant 
differences are shown in the grades awarded to boys and girls, and to arts and 
Science students, of comparable age. Linguists achieved significantly better 
marks than non-linguists, but there is no significant difference between the 
non-linguists among the arts men and the men who specialised in the ‘ pure ' 
sciences. 
The failure rates for boys taking the paper for the first time were 1 in 9 for 
scientists, 1 in 15 for arts students. For girls the corresponding rates were 
1 in 42 and 1 in 27, The male scientists who had won open awards to universities 
were not inferior in English, at ' O ' level, to arts students in general, though 
they were inferior to the arts men who won open awards. 
Among the arts men, those who went to Oxford, Cambridge or London 
gained significantly better marks than those who entered other universities ; 
a similar difference is found in respect of the men scientists, and of the women 
arts students. 
I.—INTRODUCTION. 
Ir is generally accepted that the written English of university science students 
is inferior to that of arts students, and the inferiority appears usually to be 
attributed to specialisation at university and in the last years at school. The 
chief remedy so far employed has been the provision of more time for scientists 
to pursue English studies in the VIth form, though some university dons are 
now giving their scientists linguistic training.* It has seemed to the present 
writer that the written work of some arts students and of more scientists con- 
tains faults that one would hope not to find in the work of the average grammar 
school pupil at ' O ' level, and that, before specialisation begins, the prospective 
arts student writes better English than the pupil who later specialises in one 
of the sciences. In general, however, headmasters and teachers with whom the 
writer has discussed the matter have expressed the view that at ' O’ level the 
prospective scientist is as good at English as the future arts student. 


II.—Score or THE ENQUIRY. 

Results in the Cambridge Syndicate’s ‘O?’ level examination in English 
Language have been obtained for 1,423 boys and for 882 girls who took the 
examination for the first time in 1953 or 1954, and who have since been admitted 
to universities—or, in the case of some of the men who did their national service 
first, have been offered, and have accepted, places for 1959. The writer is 
indebted to the Council of the Durham Colleges, who financed the distribution 
of a questionnaire to more than 400 schools ; to the headmasters and head- 
mistresses who replied to it ; and to the Cambridge Syndicate, whose officials 
were most helpful at every stage of the enquiry. The questionnaire asked for 
candidates’ names, examination result and date, together with candidates’ ages, 
the university to which they were admitted, and whether they followed a degree 
course in arts or in science. It was suggested that such further details as 
scholarship awards, subjects in which candidates were specialising at university, 
and whether they had taken the ‘O’ level Language paper ‘more than once, 
would be of interest. Naturally, in view of the time required for looking up 


* See, for example, Times Ed. Supp., Oct. 17th, 1958: Towards the broader view. 
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records, schools with large numbers of university entrants tended to be more 


selective in their replies to the questions. 
given ; in others, specialist subjects, or scholarship: 


In some cases pupils’ ages were not 
s, were not mentioned ; the 


H fl H n r . . 
universities to which eight boys had gone were not named. These omissions 


restrict, sometimes severely, the 


the Appendix. 


The official Cambridge list for 1958, show: 


size of the sub-samples in the tables given in 


ing 526 centres in England where 


candidates sat for the Syndicate’s papers, was used in distributing the question- 


naire. 
certain Cambridge papers, 


were new schools, or, for various reasons, 


Some schools and institutions 
but not for the English Language paper. 
had only begun to enter pupils for the 


appearing in this list entered pupils for 


Others 


Language paper after the relevant years 1953 and 1954, and the list did not, of 


course, include any centres w. 


Cambridge after the latter year. ] 
because of the type of school or institution, or because 0: 


hich may have transferred their alle; 
No approach was made to 112 centres where, 
f the small numbers on 


giance from 


the roll, it seemed clear that very few pupils could have proceeded to universities. 


Table 1 gives the numbers 


ments in the 1958 list, of those which 


and types of schools and other educational establish- 
answered the questionnaire, of those which 


did not, and of those which were not approached. Replies were received from 


73 per cent. of the schools and ins 


NUMBERS AND TYP 


Institutions providing details 
of ex-pupils at universities 


Institutions rendering nil 
returns seest ttt 


Institutions unable to fill in 
questionnaire pear 


Institutions not replying to 
EQUITY)” aes gener 
s to which the 


stitution: 
D as not addressed 


enquiry W: 


TABLE 1 


Maintained 
Grammar 


Direct ent 
Grant Schools 
* 


Schools 


and Independ- 


Sec. Mod. 
and 
Sec. Tech. 
Schools 
and other 
Institu- 
tions 


titutions to which the enquiry was addressed. 


ES or EDUCATIONAL INsTITUTIONS ENTERING CANDIDATES FOR THE 
CAMBRIDGE ' O' LEVEL PAPER IN ENGLISH LANGUAGE. 


Totals 


Boys 51 
Girls 54 
Mixed 27 


Boys 
Girls 
Mixed 


Boys 
Girls 


Mixed 


Boys 
Girls 
Mixed 


Boys 
Girls 
Mixed 


* Includes 


Boys 
Girls 
Mixed 


* voluntary cont 


Types 


rolled ' and ' voluntary aided.' 
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The boys in the sample would be roughly equivalent numerically to a 
5 per cent. sample of English boys entering universities in the years 1955 and 
1956, and would account for perhaps a similar percentage of the boys who won 
State Scholarships. The girls constitute a 6 per cent.—7 per cent. sample, and 
include rather more than 7 per cent. of the girls who won State Scholarships. 


The chief defect in both samples is probably that London and the North of 
England are very thinly represented. 


IIL—Tuz pata. 

During the period 1953-1954, the Cambridge paper in English Language 
retained the same form. An essay on one of nine topics, and a precis, were 
compulsory ; the remaining questions, from which two had to be selected, dealt 
with such matters as word and phrase equivalents in given contexts, punctua- 
tion, direct and indirect Speech, and the re-writing of a faulty prose passage. 
There were no questions dealing with formal grammar or clause analysis. The 
original marks awarded are no longer available, for the results were published 
in terms of nine grades, grade five being the lowest passing grade. The mark 
intervals corresponding to the grades are known, but the grades do not cover 
equal ranges of marks, and the highest and lowest grades are, in effect, open- 
ended, so that the means for various groupings of pupils cannot be calculated, 
though medians can be. While the grades are designed to be comparable from 
year to year, the range of marks within any one grade is not necessarily the 
same from one year to the next, so that medians should be calculated for each 
year separately. In practice, these difficulties have been of little consequence, 
for the distributions of candidates among the different grades are of more 
interest than means or medians, The distributions for various groups of pupils 


Ae shown inthe Ay pendix, and diff ze been calculated 
in terms of xt. P erences between groups have been 


() c IV.—Rzsurrs, en 
. UM). Comparison between arts and Scie nts.—The terms ‘arts a 
Science are used here to refer to groups Sil wo not to university faculties. 
Law, and Economics, for instance, are classified under ‘arts’; ‘science 
includes Mathematics, Engineering, Medicine Dentistry, Agriculture and so on. 
n a few cases, classification presented difficult but the border-line decisions 
have not e HE affected results. y À 
n each year, and for both boys and gi e median of the prospective 
2 reson 1s roughly three marks pure ee. running in theory from 
rid en though the Tange of marks between grades 1 and 9 is only 35) than that 
id HE ne ong eae The English grades for boys are shown in the first contingency 
Ti a oe Appendix, There are Proportionately fewer scientists in the upper 
g q S a Clore nthe lower. The ifference between the distributions for arts 
end ee students is highly significant (x? with 5 degrees of freedom=65, 
= 001). | Among the girls, too, there i. "t marked difference between the 
gradings of arts and science students (x? with 5 4 f.=38, P=-001). Calculations 
for each age group Separately provide no evidence ra suggest that the differences 


due to scientist i t0? t 
E musts taking the “O level Paper earlier, on average, than arts 


The boy scientists who won State Scho 

achieved better English results than other scient d —28 

i cientist: 2 with 1 d.f. » 

P= :001) ; they did less well than the Scholarship a eel pe the arts men 

(x? with 1 d.f.—9-7, P= 01), but there is no Significant difference between their 
results and those of arts men in general, 


larships or other open awards 
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(b) Comparison between linguists and mon-linguists.—Men reading for 

degrees in languages were significantly better in English at ' O ' level than men 
studying other arts subjects (y? with 2d.f.—28-6, P=-001). There is a tend- 
ency for those pursuing * pure ' science to get better English results than other 
men grouped under the general heading ‘science’ (y? with 1 d.f.—3-86, 
P=-05). But there is little to choose between men specialising in other arts 
subjects than languages, and those taking ‘ pure’ science; x? with 1 d.f. is 
-026, a value that would be equalled or exceeded in 80 per cent. of random 
samples. 
(c) The sex difference.—The medians for girls who later specialised in arts 
and science respectively are roughly three marks higher than those for the boys. 
The girls’ sample contains a higher proportion of pupils from independent and 
direct grant schools than the boys’ sample does, but there is nothing to indicate 
that this is responsible for the apparent sex difference. Line 6 in the Appendix 
suggests that 16-year old girls from the maintained schools do as well in English 
as those from independent and direct grant schools (y? with 4 d.f.=1-0, a 
value that would be obtained in 90 per cent. of random samples), and the 
difference in respect of the 15-year-olds does not reach the 5 per cent. level of 
significance. 

Among both arts and science specialists, the differences between the distri- 
butions for boys and girls are significant beyond the 0-1 per cent. level, and 
separate calculations for 15-year-olds and for 16-year-olds suggest that the 
apparent superiority of the girls is not to be accounted for by any tendency for 
boys to take the examination at an earlier age than is the case with girls. | 

(d) Distribution of students among the universities —One might anticipate 
that Oxford, Cambridge, and London would between them take a disproportion- 
ate number of the arts students who are most highly gifted verbally, and in 
fact, the distributions for these three universities on the one hand, and for the 
remaining British universities on the other, are significantly different, at the 
0-1 per cent. level for boys (x? with 5 d.f.—27) and at the 1 per cent. level for 
girls (x? with 4 d.f.=14). The three senior universities also take a dispropor- 
tionately large number of the verbally-able boy scientists—as judged by ' O 
level English results ; the difference between the distributions for Oxford, 
Cambridge, and London together, and for the other universities, 1s significant 
at the 0-1 per cent. level (x? with 5 d.f.=46). When the male scientists at the 
three senior universities are compared with male arts students elsewhere, the 
difference between the distributions is no greater than would be found in 50 

(2 with 2 d.£.—0-9). The failure rate in the 


er cent. of random samples a 
p a proceeding to Oxford, Cambridge and London 


Language paper for all boys i eee 
together is 1 in 14; for those who went to other universities it 1s lin 9. 


(e) Candidates with poor results in English at ' O ' level.—Proportionately 
more boys than girls in the sample failed in their first attempt on the Language 
aper—as is the case not only with all Cambridge candidates, but with all ‘O’ 
level entrants in England.* For the two-year sample of university entrants, 
the failure rates for boys are 1 in 15 and 1 in 9 for arts and science students, 
respectively, and the corresponding rates for girls are 1 in 42 and 1 in 27. Not 
all the first-time failures, of course, are to be regarded as poor material from the 
linguistic standpoint; one girl initially in grade eight eventually took a H (2) 
honours degree in English, and one boy scientist who failed in his first attempt 
was placed in grade one six monthslater. Nevertheless, in view of the fact that 
roughly 60 per cent. of all boys and 70 per cent. of all girls taking the Cambridge 
* See e.g, Education in 1953: HLMS.O., Cmd. 9185; and Education in 1954: 


H.M.S.O., Cmd. 9521. 
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paper in any one year pass the examination, failure in students of university 
calibre is no light matter. 132 boys (9-3 per cent. of the sample) failed at the 
first attempt, and in 125 cases the final passing grade is known. One-third of 
the scientists and one-fifth of the arts students could not achieve better than the 
lowest passing grade when they were eventually successful. At least twenty 
boys required three or more attempts at the paper, and one scientist —now 
expected to take a good Ordinary degree—needed seven. All the girls who 
failed at the first attempt succeeded at the second. 


V.— DISCUSSION AND CONCLUSIONS. 

The sex difference shown in this enquiry may be due at least in part to the 
earlier maturing of girls, but it seems unlikely that the English of women 
university students can give rise to as much concern as that of the men, and no 
further reference to the girls’ ‘ O ' level performance will be made here. 

The interpretation of the remaining findings must be governed by such 
considerations as the representativeness of the sample, the validity of the ‘O’ 
level Language paper, and the consistency of standard in the written English 
of individual students. 

(a) Sampling.—Some of the factors which, in conjunction with sampling 
errors, might affect the apparent difference in ability in English as between 
arts and science specialists can be illustrated from the records of two schools 
each of which, in the course of two years, sent some thirty boys to universities. 
School A, in a south-coast resort, contributed roughly one university entrant 
for every fifteen boys on the school roll, and one in two won a State Scholarship 
or other open award ; two out of every three were placed in grades | or 2 in 
English Language, and only one boy—a foreigner—failed at the first attempt. 
School B, serving a rural area and producing a higher proportion of scientists 
than School A, contributed roughly one university entrant for every six boys 
on the roll, one in four gaining State Scholarships ; only one in ten was placed 
in grades | or 2 in English Language, and one in four failed at the first attempt. 
School A entered only four under-age candidates for the paper, and they were 
all placed in grade 1 ; more than half the candidates from School B were under- 
age, one of them, a 14-year-old, requiring two further attempts before he passed. 

Such differences between schools render futile any attempt at general 
prescription, and emphasise the need for a wide sampling of schools in an 
investigation of the kind described in this paper. In spite of the variety of 
conditions, however, the tendency for arts specialists to achieve better results 
than scientists in the language paper is remarkably regular from school to 
school. Out of the twenty-five schools which each sent twenty or more boys to 
universities, the reverse trend is to be found in one, and in three schools there 
is no perceptible trend ; in the remaining twenty-one, the arts students achieve 
the better results, and in each of three schools, in spite of the relatively small 
numbers of pupils concerned, the difference between arts and science specialists 
is significant at the 5 per cent. level. 

It is not, of course, inconceivable that a particular examination in English 
should penalise scientists who, had they entered for the paper set by another 
examining body, might have done as well as the arts specialists. “The only 

concrete evidence concerning the comparability of papers set by different 
examining bodies is that the failure rates in the Cambridge paper closely resemble 
those quoted by the Ministry for the papers of all nine examining bodies 
together. There seem to be but slender grounds for supposing that use of the 
Cambridge paper for the investigation reported here gives an exaggerated 
impression of the poor standard of the English of some prospective university 


entrants. 
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Attention has already been drawn to the fact that some of the samples are 
very small. Inferences from the figures given in lines 3 and 4 in the Appendix, 
for instance, cannot be made with the same confidence as inferences based on the 
first two lines. 

(D) The validity of the examination.—Some evidence of validity has already 
been presented—those who specialised in languages at university level tended 
to get higher ‘ O 'level marks. English Language papers at ‘O’ level may not 
be precise measuring instruments, but lack of consistency in markers, candid- 
ates and papers cannot create the distinctions revealed in Section IV above. 
At the worst, one would expect the paper to provide some evidence of candidates’ 
ability to write clear English, using accepted conventions; to test width of 
vocabulary, grasp of sentence structure, and power to put thoughts in an 
appropriate order. If the essay does not promote any high degree of involve- 
ment, at least most candidates will have had much practice in the skills required 
for success. 60 per cent. of all boys entered for this examination pass, and it is 
improbable that the standard of English demanded for university entrance is 
any higher than that of the average 16-year-old grammar school boy. There was 
a failure rate of 8 per cent. among those boys in the sample who were reported 
to be 16 years of age when they first took the paper, and though some may have 
been merely unfortunate, it is difficult to believe that the English of the majority 
of these unsuccessful candidates could be creditable in any circumstances. 

(c) Consistency of individual performance in English.—The available 
evidence suggests that a pupil's relative standing in English is reasonably stable 
over the first five years of secondary schooling. The correlation between 11-4- 
quotients and S.C. or G.C.E. ‘O’ level English seems to be in the region of 0-5,* 
uncorrected for the effects of selection, and so comparable with the correlation 
between a typical 11+ battery and a general academic criterion five years later. 
One may at least assume that those who are good at English early on tend to be 
good later, and that defects at 16 are likely to be associated with earlier short- 
comings. Defects of long standing are not likely to be remedied quickly, and 
one would expect that attention given to them before the VIth form, if it were 
feasible, would be more effective than the provision of time and special courses 


later. ja fain conclusions.—Despite the possibilities of errors in sampling and 
oes in the measuring instrument, which have been conceded above, the 

ems ble evidence points to the following conclusions : ] 

avara ability in English between arts and science students 


sim i ace in rts anc 5 
— a entirely due to the effects of specialisation ; a difference 


sived at least as early as 15. uu l 
ini AUT fte oantet between the English of scientists and that of arts men 1s 


less fundamental than that between those reading for a degree in languages and 
dá ther subjects. . . . 
Mass takib y i better scientists (judging by scholarship awards) are 


~ On the whole the i hol vards 
€ s an the other scientists, and are not significantly inferior to 


in universities is n 


better at English th 
arts students in general. 


* The following have reported, in this Journal, correlations of 0-47, 0-48, 0-486, and 
DE. Lew Armstrong, H.G. Symposium— The use of essays in selection at 114- : 

5 "li —The predictive power of the English composition. 26, 163-171. — 5i 
Peel, E. A. and Rutter, D. The predictive value of the entrance examination. , 


30-35., PENE aS. 
ji S. Symposium : III— Reliability and validity. 26, 172-179. 
iet d i. ead Wilmut, F.S. Prediction of S.C. performance in specii 
2 


fic subjects. 
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4.—The poor English of some university students does not in itself con- 
stitute a strong case against the standard of English teaching in the grammar 
schools. University selectors are admitting candidates who collectively present, 
at the age of 16 or thereabouts, a range of ability in English which is comparable 
with that of the whole population of grammar school boys of that age. The 
universities might improve the situation by taking‘ O ' level English results into 
consideration, at least when weighing the claims of border-line candidates. 
Complaints about the poor standard of English expression of some university 
students are inevitable when, as with the present sample, over 9 per cent. of the 
men failed in their first attempt on the ' O ' level English Language paper, and 
8 per cent. of the 16-year-olds similarly failed to reach the standard achieved 
by 60 per cent. of all boys entering for the examination. 

5.—The verbally-able boys—as judged by results in the English Language 
paper—whether arts or science students, tend to go to one of the three senior 
universities. The provincial universities would probably be most affected if the 
standard of English required for university entrance were raised, or if, as the 
result of university expansion, the standard of English among university 
students in general declined. 


VIL—APPENDIX. 
University STUDENTS' GRADES IN ENGLISH LANGUAGE AT ' O' LEVEL. 


Categories of Pupils Percentage of pupils awarded the grades | No. of 
indicated. pupils. 
1 2 3 4 5 6-9 
(failed) 


(1) Boys reading arts . 
Boys reading science 


. 5 130 6-7 555 
5-1 12-1 299-1 22:8 199 10-9 868 


(2) Girls reading arts — .......... 18.2 27.8 33.2 12:2 62 2:5 582 
Girls reading science ........ 77 Bain 21 22-7 1L3 37 300 
" — OD p 
(3) Boys with State Schols. f arts 82-5 17:5 102 
or other open awards science 64-5 35.5 169 
— = — ey 
(4) Boys—languages —........... 52-9 PA 15-7 102 
Boys—other arts subjects 21.5 50-0 28:5 172 
A ——— ee 
(5) Boys—pure S iisas ponin 48-7 51:3 226 
Boys—applied science — ......, 39.7 60-3 229 
3 
6) 16-yr.-old arts girls J Maint. Scls.| 14-8 30.8 35-5 13-0 5-9 169 
(8) 16-7 EU*TInd.& D.G. | 14.7 33.7 31.2 10-3 7-1 184 
(7) Boys (arts) at London, Oxford 19-7 5 99.6 3 10-9 2 
and Cambridge — .......... BAS: gd Del a ids 
Boys (arts) at other universities | 7.5 18-7 98.9 21:3 153 93 268 
T ———— 
(8) Girls (arts) at London, Oxford | 21-4 31.4 32.6 9-2 54 261 
and Cambridge ......... — d = 
Girls (arts) at other universities | 15-6 24-9 33-6 14-6 11-2 321 
(9) Boys (science) at London, Oxford 
and Cambridge — .......... 9-0 16-8 27.3 19-0 jg. 
Boys (science) at other universit- 88 90 399 
18 63542 9:230 RARO suu ERA 15 82 30-6 26. 20-9 19.7 464 


(Manuscript received, March 23rd, 1959.) 
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A TYPE OF BIAS IN MARKING EXAMINATION SCRIPTS 


By M. J. FARRELL anv N. GILBERT 


Summary. We consider the hypothesis that, in some circumstances, the 
variance of the marks awarded by an examiner will increase with the number of 
scripts he has marked. We test this hypothesis on the data of fifty-four 
different class-lists, and find a small but highly significant effect of the sort 
predicted. We consider various possible alternative explanations of the 
observed effect, and conclude that there is fairly strong evidence of the existence 


of such a bias. 


J.— INTRODUCTION. 


IN this note we report some calculations made to test a hypothesis formed by 
one of us as to a likely type of bias in the marking of examination scripts. The 
hypothesis is that, in examinations which involve an element of judgment on the 
part of the examiner, the variance of the marks he awards will tend to increase 
with the number of scripts he has already marked. That is to say, the more 
scripts (of the relevant paper) he has marked, the more ready will he be to award 
extreme marks. We felt, on the basis of experience, that such an effect would be 
likely to arise through growing confidence on the part of the examiner. At first, 
even after a preliminary ‘reading in,’ the number of answers with which a 
given answer could be compared would be relatively small, and the examiner 
correspondingly cautious in the mark he gave; but as the number of answers 
available for comparison grew, so would the examiner's confidence. Of course, 
most examiners would do their best, by various devices, to counteract such a 
tendency but even so, some residual bias might persist. A similar effect might 
be produced by fatigue, but this would be more likely to be important in 
examinations like the General Certificate, where very large numbers of scripts 
have to be marked. In university examinations, we would expect the ' con- 
fidence ' effect to be the important one. In any case, our test is for an increase 
in variance, however it may have been caused. If scripts are marked in alpha- 
betical order this would imply that extreme marks were more frequent in the 
later than in the earlier part of the alphabet. The same effect would appear, 
though less markedly, even if the scripts are not always marked in alphabetical 
order sn long as the earlier part of the alphabet tends, on the average, to be 
marked earlier. 


IL—METHOD AND RESULTS. 


ri e of such an effect in the class lists of the Cambridge 

Tri lis M ve list is published of the successful candidates in each 
c P ohio the list being sub-divided according as the candidates obtained 
first second or third class honours, or passed at a lower level. No list is pub- 
lished of those who failed outright, and this is a definite defect in the data from 
oint of view; but, fortunately, such cases are extremely rare, passing at a 

pur p itself considered quite a serious failure. (Those who do not 


] being 1 3 
ene dass honours are often sent down by their colleges.) We, therefore, 


ini ‘i i t a lower 
number of those obtaining first class honours, or passing à |] 
ei eek third class honours as a measure of the number of extreme marks 
awarded. 
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Viewed purely as a psychological experiment, it would be much more 
efficient to use the actual marks awarded, as they contain a great deal more 
information than the class lists. We had no choice, as the marks themselves 
are not published, but in the present context there are two ways in which the 
class lists are actually preferable. First, many examiners would argue that the 
final classifications are considerably more accurate than the marks awarded, 
so that a test based on the former is fairer to the examiners. Secondly, since 
only the class lists are published, the bias will only matter in practice in so far 
as it is reflected in the class lists. 

By classifying each successful candidate according to whether his class is 
extreme or central, and according to whether the initial of his surname comes 
before L or after K in the alphabet, we can summarise the information in each 
class list in a two-by-two table like that of Table 1. From each such table we 
calculate the statistic x, where 


a+b+ce+d 
x= (ad —be) V re EHE TS 


I M of x for each of the seventy-two class lists considered are given in 
able 2. 


TABLE 1 


A to K L toZ 


Class i 


Extreme 


Central 


The total number of candidates on each class list was usually between 100 and 
200, except in Economics Part II (about 70) and Natural Sciences (300 to 500). 
In the case of the Economics Tripos, the frequency of extreme classes was too 
low for the x? approximation, on which our main significance test depends, to 
hold, so we included third class honours in the ‘extreme’ group. This is 
legitimate since the definition of ‘extreme’ is in any case arbitrary; the 
proportion of seconds and thirds is much greater in Economics than in other 
Triposes ; and so far as one can see by inspection, it does not seriously alter 
the general picture. 

The problem of combining the information from these seventy-two two-by- 
two tables so as to obtain an efficient test of our hypothesis is an interesting one. 
The frequencies cannot, of course, be added together to yield one large 
two-by-two table, since the probability of extreme classes certainly varies 
between faculties and may well do so over time. The most commonly used 
method is to observe that, since each x? is distributed approximately like x? 
with one degree of freedom, if we can assume the x’s to be independent, then 
Sx? is distributed like x? with seventy-two degrees of freedom. This gives a 
test, but an exceedingly inefficient one: for it merely considers the size of the 
deviations of x from its expected zero value, and ignores their sign. It is thus 
very insensitive to small, persistent deviations of x all in the same direction, 
which is the sort of effect we have here. è 


: 
! 


49 


M. J. FARRELL and N. GILBERT 


9S%-L2-] $Z0-S— 


L81-€— 0680+ 8IS:S— fPZPZ— £809— 0L81— S28: 0— 


SL S— 


POL: G— 


GCG E— 


260:0— 
I8e9-0--| 6eb-T-- 


926:1--| 687-0— 
€S8:S—| 990:0— 


886: 0- 
LL8:8—| oS 


Terg 
S8leL—| IZt:0— 


nem TEOL 


96F-0— $€60:0— Ekk 0— L$9-0-c- PC£-I— 9LI-0— 
BOS-O+ S8S-I+ I0£-0— 260-0— 998-0— SZI-0+ OF9-1— F6I:0— 


959-0— G68L:1+ FEOF Ler- 0+ 
160:0— 6L} 0+ €69-0— Z8S-0— 9£8-0— ES90— ZFI-O+ I9€:1— 


II Hed 
THe 


sodir] SIUS [e1njeNr 


II Hed 
Id 


sodir sor eureqep 


L61:0— 6£0:0— 069-0— SEI-0+ €68-1— Z220-0-- 20£-0— 828:0— 
996:0-- 991-S— I1€6-0-- $S9-1— 625 1— ZIZ-O+ 88£-0— 609:0-- 


LLUI— €£90:0— Z6FO— II8-0-- $89-1— F66:0— v09-I-- S6L-0+ 
8£0.:5— 00L-1— ELS0— ZPG-1— Fb9-0-- vS7-1— L66-1— gerot 


II Hea 
I Hed 


***** Sod sorurouoog 


I Hed 
Ieg IM 


uoneuruexg Áreurarmodq 
sodig 9ouetog peoruRyooTy 


YS6I €S6I 


$61 TS6I OSGI 6v61 SP6I 


‘SHTAV], AONSONILNO) ZXZ MONA GSADIS([ SGSLVIAS([ TVWOION| LNATVAINOA 


z ATAVL 


50 A Type of Bias in Marking Examination Scripts 


Just how inefficient the ‘adding x?’ test can be is illustrated in the 
present case by comparing it with a simple combinatorial test. Xx3— 83.62, and 
the chances of obtaining a value at least as large by chance are better than 1 in 7. 
However, if the deviations were purely random, the number of positive values 
of x in a sample of seventy-two would be binomially distributed with p—1, 
n—72, and the chance that as few as twenty-five of the x's should be positive 
would be less than 1 in 100. Thus, the rough-and-ready combinatorial method, 
which ignores the size of the deviations, is yet much more sensitive than the 
“adding x?’ method. 


The method actually used, due to Yates (1955), successfully takes account 
of both sign and magnitude. It turns on the fact that if x? is distributed like x? 
with one degree of freedom, then x is normally distributed with zero mean and 
unit variance. The values of x in Table 2 do, in fact, show a normal distribution 
with unit variance, although the mean is not zero. Now in the absence of bias, 
Ex should be normal with zero mean and variance 72, so that Ex(— —27-256) 
shows a deviation from its expected value of more than three times its standard 
deviation. In fact, the probability of its chance occurrence is about 1 in 1,000. 
The analysis of variance in Table 3 shows that there is no significant hetero- 
geneity—the same bias seems to persist in all the examinations and in all the 


years considered. The observed residual mean square may be compared with 
its theoretical value of unity. 


TABLE 3 


Analysis of Variance 


_ The probabilities quoted are those appropriate to testing for a change in 
variance, using a ‘ two-tailed’ significance test. Now, in fact, our hypothesis 
was that there would be an increase in variance ; it discounted the possibility 
of a reduction in variance, Consequently we should, strictly, apply a * one- 
tailed ’ significance test, obtaining a chance for the observed increase in variance 
of lin 2,000. The question is in any case of purely academic importance, since 
a probability of 1 in 1,000 is small enough to convince most people. Most 
human minds seem to jump from doubt to certainty at a significance level 
somewhere between 1 in 50 and 1 in 500. 

. Our tests assume that the values of X are all mutually independent. This 
is not strictly true, as many students take, in different years, both the examina- 


ions quoted for their faculty. , However, the overlap is very incomplete, and in 
practice the x's may be considered independent. The correlations (based on 
five or six degrees of freedom) 


follows : between the appropriate values of x are as 
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The possibility must be considered that our results reflect, not an increase 
in the variance but a shift in the mean of the marks awarded. When the number 
of firsts exceeds the number of ' passes at a lower level,’ a general increase in the 
marks awarded to L-Z candidates (as opposed to the A-K's) will result in a 
corresponding increase in the extreme classes at the expense of the seconds 
and thirds. That is, a shift in the average severity of marking will give an 
apparent change in the variance, simply because of an inequality in the 
frequencies of the two extreme classes. 

It is possible to test this by constructing another sort of two-by-two table 
for each class list, classifying classes as ‘ good’ (first or second) or ‘ bad.” The 
statistic y can be calculated from each such table analogously to x, and from 
the values of y the hypothesis of a mean shift can be tested. A little care is 
needed. The most favourable hypothesis (although rather an unlikely one @ 
priori) is that those faculties where the upper extreme is more common than the 
lower display a rise in the mean, and the others a fall. Reversing the sign of 
y (wherever necessary) in accordance with this hypothesis, the most effective 
test of this rival hypothesis is to correlate x and y. As x and y have the same 
sign in only thirty-seven of the seventy-two cases, the possibility can be 
rejected. (These arguments hold so long as the distribution of marks is 
unimodal and moderately symmetrical). 


III.—Discussiox. 


We may conclude that there is definite evidence that the variance of the 
marks awarded to the L to Z group is greater than that for the A to K's, but in 
the absence of any control experiment, it is impossible to rule out the possibility 
that this is a property of the candidates rather than the marking. That is, 
it is logically possible that those whose surnames come late in the alphabet 
shew a greater variation in aptitude for the subjects in question. 

One way in which such a situation might conceivably arise is that some 
particular national or racial group might (a) have a substantially different 
distribution of surnames between the two halves of the alphabet, and (b) a 
substantially different dispersion ın the aptitudes of its candidates, as compared 
with the rest of the candidates. If both these differences were large enough, if 
the national or racial group in question formed a sufficiently large proportion of 
all candidates, and if the differences were in the right direction, this could 


produce the observed difference. 
For example, if candidates from overseas contained a larger proportion with 
ee d the latter part of the alphabet than candidates from the United 
surnam ewed a greater variation in aptitude, then this would 


> and if they sh : 4 
Kingdom, and if they d observed. But, bearing in mind the relatively 


ect of the kind « e relat 
Lu ed iier f overseas candidates, and the (at least) moderate similarity of 


Bienne ames between the parts of the alphabet, the dispersion 
cn distribution inm to be sounds ta fr to account for even the small 
s we have observed. 

It is obvious that a large number of such hypotheses can be constructed— 
! lves more or less plausible, but unlikely to be of sufficient magnitude 
in themse he observed effect. It would be very difficult, without much more 
to explain than is available, to test any one such theory ; and, it would, of 
information ossible to test exhaustively all the conceivable hypotheses of this 
coe atr must decide for himself how much weight to allow to such 

ype. 


possibilities. 
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It is perhaps of interest to see how large the bias is. As we have said, the 
simple adding together of frequencies from different class lists is not strictly 
defensible ; but it may serve to give an impression of the rough order of mag- 
nitude of the effect. In all years, 20-1 per cent. of the A to K group obtained 
extreme classes in Part I of the Mechanical Science Tripos, compared with 
20-8 per cent. of the L to Z group. This is indeed a small bias, but its very 
smallness serves to emphasize how persistent it must be to achieve such high 
levels of statistical significance. 


The bias that is so small is, of course, that which shows itself in the class 
lists—the important quantity from a practical viewpoint. But from a purely 
psychological point of view the bias in the original marking is more interesting, 
and this may well be much larger. For the proportion of the latter that shews 
in the class lists will be reduced by any attempts by the examiners to avoid 
bias, for example, by re-reading, and in particular by any departure from 
alphabetical order. Thus, if a fraction f( —1) of the examiners read their scripts 
in counter-alphabetical order the bias is reduced by (approximately) a factor 
1-2f. (This suggests what is probably the simplest practical remedy—to have 
half the examiners read the scripts in counter-alphabetical order and the other 
half in alphabetical order. The bias in the aggregate mark is then reduced to a 
second order quantity depending on differential bias between examiners of 
papers and differential performance between papers.) It should also be noted 
that the subjects chosen are all more or less scientific ones, where the element of 
judgment in marking is relatively small. In more literary subjects one might 


expect a larger bias, although this might be offset by the examiners’ greater 
awareness of its possibility. 
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ZERO ERROR IN MORAY HOUSE VERBAL REASONING 
TESTS 


By A. E. G. PILLINER 
(Department of Education, University of Edinburgh) 


J. SUTHERLAND 
(Moray House College of Education, Edinburgh) 


AND 


E. G. TAYLOR 
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Summary. Non-equivalence in the performance of the samples of children 
used in standardising the several members of a series of similar tests results in 
lack of comparability between the standardised scores obtained with different 
tests in the series ; all the standardised scores from one test may be positively 
or negatively biassed relative to those from another. The bias is named ' zero 
error’. , 

The zero errors of twenty Moray House verbal reasoning tests, all relative 
to one taken as standard, are estimated and shown to be larger for more 
recent than for earlier tests. The increase is ascribed mainly to increased test 
sophistication in the later standardisation samples. 

Local education authorities should take zero errors into account in their 
allocation procedures. 


I.—INTRODUCTION. 


Tue purpose of this article is to draw attention to a particular source of error 
which is likely to be present to some degree in all standardised scores or 
“quotients ' obtained when one or other of a series of standardised ‘ objective’ 
tests, all bearing the same label, is administered. Examples of such series are 
the Moray House verbal reasoning tests or the National Foundation English 
attainment tests, and there are many others. The experimental part of this 
study is concerned only with the Moray House verbal reasoning tests, but there 
is little doubt that parallel studies with other series of tests would produce 


similar results.* : 
Users of tests such as these are apt to assume rather lightly that the 


i r whatever) performance level of two children is much the 
hes ueri ds the same standardised score, the first child on test A in the 
series, the second on test B. Few nowadays would interpret this common 
standardised score as indicating exactly the same performance level for these 
children, since the concept of ‘ standard error of measurement ’ is a familiar one 
(see, for example, Guilford, 1950, p. 479). Many, however, are insufficiently 
3 ibility of a bias attaching to all the standardised scores obtained 


f the possi ; i 
Prane he bests relative to those obtained on the other ; that children scoring, 
say, 80,100, and 120 on one test in the series might have scored, 84, 105, and 123, 


had a different test been chosen. 


nication, Mr. D. A. Pidgeon, Senior Tests Officer of the National 
arch, tells us that the zero error studied here with Moray 
vable with National Foundation tests also. 


* In a personal commu 
Foundation for Educational Rese 
House verbal reasoning tests is obser 
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The occurrence of a bias in standardised scores obtained from one test 
relative to those from another in the same series may be misleading ; for 
example, when a local authority uses one of a series of tests for the ‘ main’ 
examination and another for the ‘ supplementary ’ examination for absentees ; 
or when an allocation procedure involves a cut at some predetermined standard- 
ised score, regardless of the test used. Such procedures, too rigidly applied, 
are to be deprecated. They imply a faith in the consistency of calibration from 
test to test, which those who construct and standardise them are unable to 
share. 

In the experimental investigation to be described, the mean standardisa- 
tion bias of each of a number of Moray House verbal reasoning tests, relative to 
one chosen arbitrarily as standard, is evaluated. Since each bias reported is 
based on an experiment in which the subjects were a random sample from a 
larger group with a mean standard score of approximately 100 which corresponds 
to zero on the standard score scale; and since the bias under discussion is 
similar in some respects to the systematic errors sometimes occurring in physical 
measurement; we propose to call this standardisation mean bias ‘zero’ 
error. 


IIL—ORIGIN OF ZERO ERROR. 


The standardisation of a test initially involves its administration to a 
large sample of children (the nature of this sample will be discussed later), of 
the appropriate age range. This sample, called the ‘ standardisation ' sample, 
is sub-divided by month of age, and within each month group the children's 
original or ‘ raw ' scores are used to obtain their percentile ranks. These ranks 
are then transformed to a ‘normal’ distribution of standardised scores or 
‘ quotients,’ usually with a mean of 100 and a standard deviation of 15. After 
repeating this procedure for all month groups in the sample a conversion table 
or table of norms is prepared which relates the raw scores and ages of the children 
in this sample to the corresponding standardised scores. Similar conversion 
tables are prepared for each test in the series, using each timea different standard- 
isation sample. Each table of norms thus prepared makes it possible to match 
the performance level of any child subsequently taking the corresponding test 
with that of some children in the standardisation sample. For example, if 
entering the table with some testee's age and raw score assigns him a standard- 
ised score of 100, his performance level on the test is equal to that of all the 
children in the standardisation sample whose several raw scores place them at 
the 50th percentile in their several month groups (Thomson 1932, Lawley 1950). 

Suppose that on some particular occasion, a test was chosen, that it was 
used with a number of children, and that the mean of their standardised scores 
was 100. Now suppose that a different test in the series had been chosen and 
used on this one occasion with the same children and that their mean standard- 
ised score on this different test was 106. Note that, in fact, only one test has 
been administered and only one mean is available: either 100 or 106; and 
that the test user accepts whichever turns up in ignorance that the other might 
have done so. It is not inconceivable that some important decision may hinge 
on the result. 

The obstacles in the way of achieving the ideal that any child tested should 
obtain the same standardised score, whichever test in the series happens to be 
the one administered to him are : 

(1) Differences in content from test to test, resulting in slightly different 
rank orders for the same children. 

(2) Non-equivalence in the standardisation samples. 


———— | ee ee 
M ^ 
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With regard to (1), the imperfect correlation results in unbiassed errors of 
measurement which can be estimated with some precision if the correlation is 
known. It is worth pointing out that it is, in fact, unknown ; the ordinary test- 
retest correlation based on the administration of different tests to the same 
testees on different occasions under-estimates the correlation which should be 
employed, namely, that between the children's scores on the test used and the 
scores they would have obtained with a different test om this one occasion. 
An under-estimate of the correlation means an over-estimate of the standard 
error of measurement. In estimating the standard error of measurement we 
m err, therefore, on the safe side by basing it on the usual test-retest correla- 

ion. 
With regard to (2), by ‘ equivalence ’ would be meant that we can legiti- 
mately assume each standardisation sample to be representative of the same 
specifiable ‘ population ' of children ; a population against which it makes sense 
to measure the prospective testees in test performance. If obtainable, equiva- 
lent samples would be inter-changeable in respect of the tests, without affecting 
the tables of norms. Testees would be pitted virtually against the same popula- 
tion, whatever the test, and any discrepancies in their standardised scores if 
one test were used rather than another would be due entirely to differences in 
test content, as discussed in (1). With equivalent samples as here defined, there 
would be no systematic bias associated with all the standardised scores obtained 
with a particular test. ' Non-equivalence' would imply either that the 
standardisation samples are not representative samples from the same popula- 
tion or that they are samples from different populations. In either case the 

ossibility would exist of systematic as well as unsystematic errors occurring. 

The population to be represented by the standardisation samples consists 
of a large number of local authority year-groups, the mean standardised scores 


and the allocation procedures of which show remarkable diversity. A number 
d and so must be 


of these authorities prefer to use tests already fully standardise 
excluded from consideration. In practice, therefore, it is impossible to achieve 
completely representative sampling and some approximate procedure must be 
used, such as the following. The use of newly constructed tests not yet fully 
standardised is restricted to those other authorities which are prepared to 
co-operate by making available the raw scores and ages of the children in year- 
groups working these new tests; 1n return these authorities receive specially 
constructed tables of norms based on the data supplied. From among the 
authorities in this sub-population which have chosen à particular new test a 
sample is selected which, on the basis of previous experience and current 
performance, is judged to be as nearly representative as possible of the general 
population of authorities. Finally, a ‘national’ conversion table for the new 
test is prepared from the pooled data of the authorities thus selected from the 
sub-population. 
In spite of these difficulties, the zero error of any one Moray House verbal 
reasoning test relative to its neighbour in the list, is usually remarkably small ; 
both, it should be noted, will have been standardised at about the same time. 
However, local authorities using new and old tests with consecutive year- 
oups (which may be expected to be about equal in mean performance), do 
experience a rise in mean standardised score from the new to the old test which 
indicates non-equivalence in the samples used for standardising these more 
widely separated tests. This non-equivalence results less from the selection of 
oor samples from a given population than from a slow change in the perform- 
ance characteristics of the population itself, particularly since the early 1950 s. 
It appears to be associated mainly with the observed wide-spread increase 
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during recent years in ' test-sophistication ' which will be the subject of later 
discussion. 

An experimental study designed specifically to estimate as precisely as 
possible the zero errors of commonly used tests was undertaken for the following 
reasons. First, such a study might contribute information useful in a general 
survey of test standards. Second, the results might be helpful to investigators 
using these tests for the study of population trends, earlier maturation, the 
effects of practice and coaching, and the like, including also the re-evaluation 
of the results of previous studies in these fields in which the zero errors of the 
tests used were not taken into account. Third, the publication of the figures 
obtained might not only contribute to a more general awareness among test 
users of the existence of zero error, but also give those responsible for allocation 
procedures very practical assistance in the interpretation of the results obtained. 


IIL—THE EXPERIMENTAL INVESTIGATION. 


The tests chosen were the verbal reasoning tests M.H.Ts. 33 and 39-58, for 
which ‘ national’ norms had been prepared over the period 1945 to 1957. For 
Six of these tests, namely M.H.Ts. 39-44, data were already available from 
Michael F. Moore's previous study (1955) of the practice effects resulting from 
the administration of six verbal reasoning tests in succession to the same group 
of children. Moore's study, while primarily concerned with practice effect, 
differed from previous studies of this kind in that his experimental design 
enabled him to estimate the zero errors of the tests used and to disentangle 
these errors from the practice effect. Each of six groups of children worked six 
tests in six different orders, the whole conforming to a latin square design, the 
reciprocal nature of which enabled Moore to isolate both practice effect and 
zero error simultaneously.* The only draw-back to Moore's elegant design was 
that the final computations had to be restricted to the scores of six numeric- 
ally equal groups of children all of whom completed allsix tests. Illness during 
the administration period resulted in considerable loss. Profiting by Moore's 
experience, the present writers employed for the remaining tests a method 
which minimised wastage while retaining the advantages of the latin square. 

The simplest way of carrying out the experiment would have been to use a 
number of large independent random samples, that is, equivalent samples. 
Sample 1 would have test A, sample 2 test B, and so on (Lindquist, 1953, p. 47). 
With proper randomisation, the expected mean performance level is the same 
for all samples, so that any observed differences among the standardised score 
means would be ascribable to zero error. This method was rejected because of 
the practical difficulties in the way of efficient randomisation and the lack of 
precision inherent in the design. 

Instead, separate comparisons were carried out with two tests at a time. 
In each comparison, both test A and test B were administered to the same group 
of children using the ' cross-over ' method. The children were divided at random 
into two numerically equal groups. Group I took first test A, then test B : 
Group II took first test B, and then test A. The difference between the overall 
mean score on A in its two positions and the corresponding overall mean for B 
gave the mean zero error, with practice effect eliminated. 

Test A was always M.H.T. 40, chosen as standard. Test B was one of the 
other tests, the experiment being repeated, each time with different children, for 
all the tests except for Moore's six. The variable was standardised score 
(‘ quotient ’) on the ‘national’ norms in the test manuals. 


* The desirability of eliminating the effects of zero error in the test used for practice 
and coaching experiments was previously noted by Vernon (1954, p. 59). 


a HEN 
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The children and the tests were allocated to the groups as follows : Primary 
schools were chosen at random from the Edinburgh and Midlothian lists. Six 
of these schools, again chosen at random, were allotted to each comparison 
AB,, AB, ABs, etc. Within each school the available children were allocated 
at random to two groups of the same size for each school, but not necessarily 
so from one school to another. Finally, within each school the group designa- 
tion I or II, was decided by the toss ofacoin. Group I's testing order was always 


AB and group II's BA. : 
The main advantages of this design are: first, the effect of absences is 
Second, the replications in 


less catastrophic than in the larger latin square. 
des information about the ' interaction ’ 


six schools for each pair of tests provi abc 
between school and zero error, that is, the extent of variation of zero error from 


school to school, in addition to information about the ‘ main’ effect, the mean 
zero error overall. Third, since a random half of the random sample of children 
engaged in the experiment took M.H.T. 40 first, valid inferences can be drawn 
about the performance on this test of the population sampled, and this informa- 
tion is useful in judging the validity of regarding the zero errors obtained in the 


experiment as generally applicable. 
IV.—EXPERIMENTAL RESULTS. 
Table 1 summarises the main results of the enquiry : 
TABLE 1 


Zero ERRORS or Moray HOUSE VERBAL REASONING Tests RELATIVE TO M.H.T. 40. 


1949 1949 1950 1950 1951 1951 1952 


Standardisa- 
tion Date 1945 1948 1948 


142 162 


132 132 132 132 132 132 162 


m 39 40 4h 42 43! 44! 45 46 — 47 
"enn 0-28 —042 -0-14 -0-31 -0-85 -1-09 


Zero Error .. -0:92 0 -0-41 


 Standardisa- 53 1954 1954 1956 1957 1957 


tion Date 
2 210 168 188 
ATHE 50 193 52 4 55 56 57 58 
3g cag 281 576 54:72 -8:70 -5.98 -6-11 -6-14 


N-—number of children in each experiment. 
s are those obtained by him. 
d by Mr. Moore, and the zero errors are t y 
Quee wore Ne ells is gratefully acknowledged. 


Permission to quote 


Mean standard error of zero errors relative to M.H.T. 40— 4-:57. Mean standard 
error or Zero errors derived as in (2) below— -80. 

ive sign means that the mean standardised score on the test 

One sa Sealer by the zero error recorded than that obtained 


indi is expected to be less 1 
= oe pon if M.H.T. 40 had been used instead. 


Zero errors relative to tests other than M.H.T. I 
mn umts Table 1. Thus, the zero error of M.H.T. 58, relativ 


is —6:14—(—9:98)— —0-164-S.E. -80. 


40 may be readily, 
eto M.H.T. 56 
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Table 2 presents a typical example of the fifteen analyses of variance.* 
That in Table 2 results from the comparison of two tests, here M.H.T. 40 (test A) 
and M.H.T. 56 (test B). The analysis is the outcome of a design modified from 
the replicated 2x2 latin square (‘cross-over ') design of Cochran and Cox 
(1950, pp. 112-116). The numbers of children in the six schools concerned were 
32, 40, 40, 40, 36, 30, and 32; within each school the children were allocated to 
groups I and II, in equal numbers. Group I's test order was AB ; group II's, 
BA. 


TABLE 2 


ANALYSIS OF VARIANCE OF STANDARDISED SCORES OBTAINED BY CHILDREN IN SIX SCHOOLS 
on M.H.T. 40 anp M.H.T. 56. 


S.S. 


Schools 18,256-7 
Groups within schools socials 619-2 
Ist vs. 2nd occasion ... A 367-7 
Schoolsxoccasions ..... i 119-9 


Tests end 3,750-1 75-0(1,5)** 
249-8 0 |4-13(5,198)** 
37,5737 


2,397.9 


63,335 -0 


——— d LLL 


The F's recorded indicate that the overall mean zero error is significant 
and that the zero error varies significantly among schools. 

In Table 3 are reported the standardised score means for the six schools 
used for the M.H.T. 40—M.H.T. 56 comparison. Each school mean is based on 
the sum of all scores obtained by the children in the school, half of the scores 
being in AB order, the other half in BA order. Table 3 gives also the mean zero 
error for each of the six schools. The regression of zero error on school mean 
should be noted. 


TABLE 3 


ScuoorL MEANS AND ZERO ERRORS. 


2 3 4 5 6 AII 


40 40 36 30 32 210 


110-23 108-19 108.14 100.28 96-07 92.38 102-99 
—778 —653 747 —6.28 327 4.19 —5-98 


* The scores from M.H.T. 39-44 were analysed by Moore in a single table. 


V.— DISCUSSION AND IMPLICATION OF RESULTS. 


Each of the zero errors in Table 1 is based on a comparison between the 
arbitrary standard M.H.T. 40 and one other test administered to the same sample 
of children. The care taken over sampling procedure makes it reasonable to 


A. E. G. PILLINER, J. SUTHERLAND and E. G. TAYLOR 59 


accept these observed zero errors as estimates of the population values, the 
population consisting of the pool of two year-groups of children of appropriate 
age. A sample of 1,291 children drawn from these two year-groups—those 
who took M.H.T. 40 first—had a mean on this test of 105-5, which we may take 
as a good estimate of the population mean on M.H.T. 40. Had this population 
of two year-groups taken one of the later tests with a zero error of 5-6 points, the 
expected mean would have been close to 100, this expectation tallying with our 
knowledge from other sources of the mean performance levels on later tests of 
these year-groups. The weight of the statistical evidence, namely: the adjusted 
mean of about 100; the observed standard deviation of 13-3; the coefficients 
of skewness (B,— —-029) and kurtosis (8,—2-52) ; all this makes it probable 
that the Table 1 zero errors are applicable to most current year-groups. 


These zero errors are themselves subject to error of measurement. Omitting 
Moore's six tests, there remain fifteen comparisons with M.H.T. 40 in most of 
which the Schools x Tests interactions were significant (the mean square of 
50-0 in Table 2 is rather larger than average). The significance of these interac- 
tions has important consequences for the precision of the zero error estimates, 
since it implies a significant heterogeneity of zero error from school to school in 
each comparison which must be inferred in the population sampled; the 
appropriate errors of measurement must, therefore, be based on the mean square 
for the Schools x Tests interactions. Thus, in the M.H.T. 40—M.H.T. 56 com- 
parison, the estimated zero error of 5-98 has a standard error of /(2x 50/210) — 
-690. The average of all such standard errors is 0-570, which is appropriate for 
zero errors as recorded in Table 1 (i.e., relative to M.H.T. 40). For zero errors 
obtained by taking the difference between two recorded errors (i.e., relative to a 
test other than M.H.T. 40), the mean standard error is -570 /2— -80 approxi- 
mately. 

A study of the regression of zero error on school mean in the fifteen com- 
parisons reveals in some, but not all, a tendency for zero error to increase with 
school mean. The data of Table 3 provide a striking example; the regression 
is highly significant. In other comparisons the evidence is less clear, but the 
general trend is in the same direction. mE the E eni regresia 
are insufficiently large to necessitate adjustment o — a error over the 
range of local authority mean quotients likely to be met wi pP 

Table 2 illustrates a further point of some importance. "s Dac Moan 

; : -es the mean error variance of measurement of 
square in the table (12-1) measures ; This error variance implies a 
children’s individual standardised scores. aa E it “95 endi bin 
correlation between M.H.T. 40 and M.H.T. Sco! ] 


i eviation of 15.* Correlations, similarly estimated, 
Pee MET. add each of the other tests, range from -92 to -96 with 
average -94. This uniformity of correlation is evidence that all the tests will 
place the same set of testees in much the same rank order as M.H.T. 40, and 
hence as each other. For the validities of the tests in the series with respect to a 
common criterion, this uniformity is important. There is no evidence of that 
cumulative error, producing a * drift’ from the criterion, against which Heim 


(1954, p. 109) rightly warns us. x , n 
in Table 1 show the effect on the mean standardised score 

f ie tee eee testees if one test, rather than another, happens to be 

choseñi. For example, the mean on M.H.T. 58 is estimated as 6:14 (+SE -57) 


points lower than that on M.H.T. 40; and, similarly, the mean on M.H.T. 54 


* p= (152— 12-1)/15°= :95. This follows from : error variance=s?(1— r). 
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as 3-63 (+SE -80) points lower than that on M.H.T. 47. These apparent 
differences in ‘ difficulty ' indicate a rise in the performance level of the samples 
available for standardisation from the earlier to thelater tests. The same group 
of testees, pitted against a later instead of an earlier standardisation sample, 
will do relatively less well and so obtain lower standardised scores on the later 
test. There are indications in Table I of a very slight upward trend in the zero 
errors over the first eleven tests listed, and a quite sharp discontinuity between 
that of M.H.T. 48 and that of M.H.T. 49. The upward trend then persists, but 
appears to be levelling out over the last four tests in the list. 


There can be little doubt that this rise in performance level reflects mainly 
the effect of test-sophistication, increasing from earlier to later standardisation 
samples. Children in later samples have become more familiar than their 
predecessors with the type of material included in tests such as these through 
more extensive coaching or practice on similar material. It is significant that 
the discontinuity in zero error between M.H.T. 48 and M.H.T. 49 occurred at a 
time (1953) when many local authorities (including those used in the standardisa- 
tion samples) were introducing full-scale practice tests as a preliminary to 
their allocation procedures; the time of the public controversy over coaching 
and practice (Vernon, 1952). The further upward trend and the flattening 
towards the end of the series suggests that saturation may now have been 
reached, though we cannot be certain of this. 


We can only speculate about other influences contributing to the rise in 
performance level reflected in increasing zero errors. Tanner (1958, p. 1689) 
has pointed out that as a result of better nutrition and perhaps because the world 
is getting warmer, children in Western Europe and America are maturing 
physically at an earlier age than their predecessors; according to him the 
presumption is that there is a similar trend in mental development. If this is so, 
the effect on performance and hence on zero error will reinforce that of 
increasing test-sophistication. 

We turn now to the practical uses to which a knowledge of these zero errors 
can be put. In the first place they should be useful to those responsible for 
testing groups of children when comparisons have to be made between two 
groups of children tested with different tests, A knowledge of the zero error of 
one test relative to the other will enable the tester to adjust one of the observed 
group means so that the comparison between them is unbiassed. The step from 
group means to individual scores is a little hazardous ; this study is concerned 
with biasses in mean scores only, and provides no evidence that the bias is 
constant throughout the range of individual scores ; indeed, we present some 
slight evidence that zero error, like practice effect, is positively correlated with 
score. However, if the step from mean scores to individual scores is made, it 
will be in the right direction even if its magnitude is not entirely correct. Thus, 
in the case of a child who moved from one area where he obtained a standard- 
ised score on M.H.T. 47 of 114 to another where the test used was M.H,T. Shs 
it would be fairer to adjust his standardised score to 119, than to leave it at 114 
in placing him in relation to the children in his new year-group. Again, in the 
case previously mentioned, of a supplementary examination administered to 
absentees from the main examination, it would not be unreasonable to diminish 
the score of each child taking the supplementary examination by 4 points if the 

supplementary test were M.H.T. 50 and the main test M.H.T. 55. 


Zero error may need to be taken into account in interpreting the results of 
research investigations; for example, the study by Wiseman and Wrigley 
(1953) of the effects of coaching and practice based on test booklets containing 


A. E. G. PILLINER, J. SUTHERLAND, and E.G. TAYLOR 6I 


material alleged to be similar to that incorporated in verbal reasoning tests 
They used a number of Moray House tests to evaluate the practice and coaching 
effects at various stages in their experiment, and had reason to suspect, from 
the general pattern of group mean scores, standardisation errors in M.H.T. 39 
and M.H.T. 40. If we adjust (where we have the data to do so) the mean scores 
obtained by Wiseman and Wrigley in the test run for the children who had 


simple practice we obtain the following : 


Initial Practice Tests 


Test 


M.H.T. .. 43 37 38 39 40 41 42 


W. and W’s 
Means....| 99-62 | 104-7 108-88 105-73 109.05 110-57 
* * 


Zero Errors | —0-42 —0-92 —1-41 0-28 


Adjusted 3 
Means....| 100-04 |(104-07) (108-88) 106-65 110-46 110-29 


The adjustments have had the effect of bringing the M.H.T. 39 mean more 
into line with the general trend, and the flattening out from M.H.T. 40 is slightly 
more regular than with the unadjusted array of means. The authors’ main 
conclusions remain unaltered, but it is easy to see how misleading the results 
might have been had tests with widely discrepant zero errors been chosen for 


the experiment. 

Finally, the results of the present study have implications for the problem 
of test standards generally. The upward trend of zero errors is evidence that 
the level of performance on tests of this kind has risen generally and may con- 
tinue to rise. Test standardisations must of necessity be based on the perform- 
ances of contemporary samples of children ; and these, it is argued here, are 
not equivalent to earlier samples. Are those responsible for the standardisations 
to continue to assign to each new test an arbitrary mean standardised score of 
100 in spite of this known non-equivalence ? Or are they to relate the test means 
to that of some previous test, standardised on a sample of children of an earlier 

:ng to that test only—as in this study—an arbitrary mean of 


vintage, assigning fail to take account of and, indeed, to 


? dopt the first course is to 
rca dd un donbied rise in performance level the country over ; to adopt the 


second is to accept the necessity of embarking on an extension of the work 
reported here, to co-relate all present and future tests in this and all other 
series of tests from whatever source —a formidable task. The second course, 
though perhaps preferable, presents diffculties, particularly with regard to 
tests of attainment. Test constructors are continually trying to improve their 
tests, to incorporate in them the results of recent research findings, to increase 
their validity, to make them more liberal in conception and to minimise 
undesirable ' backwash ' effects on school curricula. To relate performance on 
some future English attainment test, say, to that on an early member of the 
series, when the tests have little in common except their name, would seem 
unrealistic. 

The time has come for urgent consideration by psychometrists 


issues such as these. 


generally of 
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DIFFERENCES IN ATTAINMENT BETWEEN PRIMARY- 

SCHOOLS IN MIXED-LANGUAGE AREAS: THEIR 

DEPENDENCE ON INTELLIGENCE AND LINGUISTIC 
BACKGROUND 


By D. G. LEWIS 
(Department of Education, Queen's University, Belfast.) 


Summary. Differences in attainment in English and arithmetic between 
primary schools in mixed-language areas of Wales are studied with respect to 
differences in both intelligence and linguistic background. School attainments 
tended to increase with intelligence, though not regularly; and the school 
differences in intelligence did not account fully for the differences in attainment. 
Generally, the attainment of schools with pupils of a strongly Welsh back- 
ground—i.e., bilingual pupils—was lower than that of other schools, this 
tendency being strong in English and slight in arithmetic. It is concluded that 
a bilingual environment may be regarded as an important factor with respect 
to primary-school differences of attainment in English which cannot “be 
accounted for by intelligence. E 


I.—INTRODUCTION. 


Tuis investigation is based on the results of standardized tests of English, 
arithmetic and (non-verbal) intelligence which were given to all the 10-year-old 
pupils from sixteen primary schools in the counties of Cardiganshire, Carmar- 
thenshire, Merionethshire, and Pembrokeshire.* Linguistic background was 
assessed by means of a language questionnaire, and test scores from a total of 
375 pupils were obtained. The schools difiered considerably with respect to 
the linguistic background of their pupils, some drawing most of their pupils 
from Welsh-speaking homes, others having only a small minority of such pupils. 
None of the schools were ' W elsh ' schools, where the medium of instruction is 
Welsh, and all the pupils could understand and speak English with reasonable 
ffuency Pupils with a strongly-Welsh background are thus bilingual, while 

; 1 of limited * Welshness ' are monoglot. The aim of this 


se with a backgrounc L : 
pect i is s": see how any differences m attainment between the schools 
vestige E v any 
ding difference 


relate to correspon s in intelligence, and, in particular, to see 
whether the attainment differences are im excess of what might be expected 
ts differences in intelligence. If this is so, then changes m the linguistic 
roi = e oi the pupils (ie. the varying degrees of ‘Welshness ’ in their 
luem gne eRT may appear as an important factor. Certainly, this 
possibility should be considered before attributing any unexplained attainment 
differences entirely to the teaching factor. 


II.—DESCRIPTION OF TESTS AND QUESTIONNAIRE. 


ish ai "i lic Tests. 

.—The English and Arithmetic : : 
^ Two tests of attainment In English were given: | 

(i) Schonell’s Silent Reading Test B (Test 4), which provides a measure ot 
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attainment in the comprehension of English. It consists of twenty items, of 
which the testee is required to select, from a number of alternatives, words 
which make the best sense when put in blank spaces in a short passage of 
prose. The test has a time limit of 15 minutes. It will be referred to as English 
Comprehension. 

(ii) Moray House English Test 21 (M.H .E.21), which provides a measure 
of attainment in the general usage of English. It is a test designed especially 
for grammar-school selection and, during recent years, has been in private use 
amongst education authorities for this purpose. It consists of 120 items 
(synonyms, spelling, punctuation, vocabulary, idioms, etc.) and has a time limit 
of 40 minutes. It will be referred to as English Usage. 

Two tests of attainment in arithmetic were given : 

G) Schonell's Essential Mechanical Arithmetic Test (Form A), which pro- 
vides a measure of attainment in the basic arithmetical processes. 

(ii) Schonell’s Essential Problem Arithmetic Test (Form A), which provides 
a measure of attainment in the understanding and solving of written arith- 
metical problems. 

Both these tests consist of fifty items and have a time limit of 30 minutes. 
They will be referred to as Mechanical Arithmetic and Problem Arithmetic 
respectively.* 


B.—The Intelligence Test. 

Jenkins’ Scale of Non-verbal Ability (N.F.E.R., 1947) was used as a 
measure of intelligence. Previous investigations have suggested that verbal 
tests of intelligence are not suitable for measuring the intelligence of bilingual 
children—or at any rate for assessing their capacities relative to those of 
monoglot English-speaking children (see, for example, Saer (1922 and 1923), 
Smith (1923), Barke (1933), Barke and Parry-Williams (1938), and Jones (1952)). 
Jenkins’ test is non-verbal and diagrammatic in character, the testees having to 
select the correct figure for each item. It consists of eighty-five items, arranged 
in five sub-tests separately timed, the total time required being 30 minutes. 
An alternative version of the test, in which all instructions were given in Welsh, 
was available for all children who preferred to use it. 


C.—The Questionnaire. 

A language questionnaire constructed along lines similar to Hoffmann's 
Bilingual Schedule (Hoffmann, 1934) was administered to assess the (English- 
Welsh) linguistic background of the children tested. The questionnaire had 
previously been used in Caernarvonshire to select a Welsh-speaking group for 
the purpose of standardizing certain tests (see Miles, 1953). From the question- 
naire scores children whose background is strongly Welsh can be distinguished 
from those whose background is mainly English or linguistically mixed. A 
Mm E of the questionnaire was provided for all who preferred to answer 
it in Welsh. 


IIL.—ANALYSIS OF TEST RESULTS. 


School attainments were found to differ considerably for each of the four 
attainment tests.[ There was a tendency for the school attainments to jncrease 


* See Schonell (1950), 71-77 and 87-95, for further details of these tests. 


f In one case, however, (English Comprehension) the differences just failed to attain 
significance at the 5 per cent. level (F=1-60, for 15 and 359 degrees of freedom). 
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with intelligence. In no case, however, was this increase a regular one. The 
correlation of each set of school attainment means with the corresponding 


intelligence means was as shown in the following table.* 


TABLE 1 


Compre- Usage Problem 
hension 


Arith. 


Correlation Coefficient 
(between attainment means 
and intelligence means) 


Differences in attainment between schools may clearly be considered as being 
dependent, to some extent, on the corresponding differences in intelligence. It 
seemed desirable, therefore, to adjust the attainment differences to make 
allowances for the school differences in intelligence. This was done by the 
analysis of covariance (see, for example, Lindquist, 1940, 180-207). The school 
attainment means were thus compared in relation to their intelligence means 
and the within-schools regression of attainment on intelligence. In the case of 
the English tests this comparison is illustrated by Figures I and II. It is seen 
that, in both tests, the dispersion of attainment means around the regression 
line is considerable. The dispersion, in fact, was found to be statistically 
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FIG. II. English Usage and Intelligence. 


significant (at the 1 per cent. level) for all four attainment tests. Differences 
in intelligence between schools do not account fully for their differences in 
attainment. 

In both the English tests schools 1 and 14 are seen to have done markedly 
better than might be expected on the basis of intelligence alone, the points 
representing these schools being well above the regression line in both diagrams. 
At the other extreme the points representing schools 7, 9 and 5, are seen to be 
appreciably below the regression line in both diagrams. Schools 4 and 3 may 
be noted as having a high adjusted mean* on Usage. School 8 yields the curious 
result of having done well on Comprehension, being third in the order of adjusted 
means and yet very poorly on Usage. All other means on the two English tests 
seem to be fairly close to the regression lines. 

In both the arithmetic tests schools 1 and 9 were found to have done 
considerably better than their intelligence expectation, i.e., n diagrams 
corresponding to those shown for the English tests the points representing these 
schools would appear well above the regression line. At the other extreme the 
attainments of schools7 and 2 were found to be markedly less than that expected 
on the basis of intelligence. Again, school 14 had a high adjusted mean on 
Mechanical Arithmetic and school 6 a low adjusted mean on Problem Arith- 
metic. Allother means in the arithmetic tests were fairly close to the regression 
lines. 

Thus, for the results as a whole, school 1 has done extremely well, its 
adjusted mean being actually the highest in all four tests. Although this school 
has the most intelligent group of 11-year-olds (its intelligence mean is the 
highest as may be seen from the position of the point representing this school on 
the extreme of the figures) so that high attainment means are to be expected, 
nevertheless, the actual attainment means are higher still. School 7 on the 
other hand, can be noted as having attainment means appreciably below what 
might be expected from intelligence. The results of the linguistic-background 
questionnaire can thus be considered to see whether they afford a partial 
explanation of these school differences. 

* i.e., high in relation to the regression line, which provides the basis of the adjustment 
for school differences in intelligence. 
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IV.—THE QUESTIONNAIRE RESULTS. 


Considerable differences in linguistic background would naturally be 
expected within each school, but a convenient index of the linguistic background 
of each school group as a whole is provided by the percentage of the group whose 
background, as assessed by the questionnaire, is strongly Welsh. In the report 
of the administration of the questionnaire in Caernarvonshire all who scored 
from +51 per cent. to -+ 100 per cent. were grouped together ‘ definitely Welsh ’ 
(Miles, ibid., 20). However, the writer, in a recent paper,* has put forward 
strong reasons for favouring a more narrowly-defined and homogeneous 
' Welsh ' group (+90 per cent. to 4-100 per cent. on the questionnaire). Con- 
sequently, the percentage of pupils for each school in this * Welsh ' group was 
taken as a linguistic index of the school. These linguistic indices—which can 
be regarded as measures of the ' Welshness' of background of the groups 
tested—are shown, together with their ranks, in the following table : 


TABLE 2 


School No. 


Linguistic 
Background | 20-0} 14-6; 15-1 
Index 


25-0} 41-6 | 36-2 | 43-7 | 50-0 | 31-2 | 24-2 | 18-7 


Index-rank | 11 


It is seen that schools 1 and 14, already noted as having high adjusted 
attainment means on both the English tests, have low index-ranks 11 and 16 
respectively. Again, schools 7, 9 and 5, noted as having low adjusted means on 
both the English tests, all have much higher index-ranks of 4, 3 and 5 respec- 
tively. It is clear then that schools which have done markedly well on the 

‘ch tests, i.e., noticeably better than would be expected on the basis of 
English il , tritelli ence alone, are schools with a low index-rank and, again, 
Seir Apes s E in English are noticeably below their intelligence 
sengol maras schools with a high index-rank.f In the same way, schools 4 
BAD pene sai noted as having high adjusted means on Usage havelow index- 
and 3, whic à T: pen ectively, while school 8, which has a surprisingly low attain- 
rario 10am t t ne a fairly high index-rank of 5. Possibly, too, the difference 
ment on this nals attainment on the two English tests can be related to its 
between this "n round index. If the usage of English is more dependent on the 
linguistic-bac "the comprehension of the language (which may be the more 
Home oani the school in strongly Welsh-speaking areas), then schools with 
dependent -: ks may reasonably be expected to show attainment differences 
high nt any rate school 8's shift in position from Comprehension to 
EE dum be contrasted with that of schools of low index-ranks (e.g. 3, 4 and 
14) S bid show no such shift. 
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In the arithmetic tests the tendency for high adjusted attainment means 
to be associated with low index-ranks and low adjusted attainment means with 
high index-ranks is not evident from inspection. Thus, while school 1, with an 
jndex-rank of 11, is at the top end of the attainment scale, so also is school 9, 
with an index rank of 3. Again, at the other extreme, schools 7 and 2, both with 
low adjusted means on both tests, have index-ranks of 4 and 14 respectively. 
No association between adjusted attainment means and the index-ranks thus 
suggests itself. 

However, the relative magnitude of these tendencies for all four tests can 
be assessed by correlating the adjusted attainment means with linguistic back- 
ground. Spearman's rank correlation, obtained by correlating the ranks of the 
adjusted means with the ranks of the linguistic-background indices, is shown 
below for each test. The standard error, calculated from the formula involving 
the equivalent Pearson r, is also shown (see Guilford, 1936, 341). 


TABLE 3 


Comprehension Usage Mech. Arith. Prob. Arith. 


—:51 —.55 —42 


18 26 


The first two of the above correlation coefficients are, numerically, considerably 
larger than their standard errors and may be regarded as signicantly negative. 
This is obviously not so for the last two. This confirms that the tendency for 
high adjusted means go with low index-ranks and for low adjusted means with 
high index-ranks is a marked one in the case of each of the English tests, but 
that it is only a slight one in the case of the arithmetic tests. 


One can conclude then that linguistic background can be regarded as an 
important factor of the school differences in attainment in English which are 
not explained by differences in intelligence. With respect to the corresponding 
attainment differences in arithmetic linguistic background is of less importance. 
There is some evidence of a tendency similar to that operating in English, but 
further research, with an analysis of results from a larger number of schools, is 
necessary before deciding whether this tendency, though small, is significant. 


Finally, it does not follow that the correlations of the actual school attain- 
ments in English and arithmetic with linguistic background are the same as 
those shown in Table 3. Although it would seem reasonable to expect intellig- 
ence to be independent of linguistic background, it must not be assumed that 
the actual measures of these used in this investigation are uncorrelated. Spear- 
man's rank correlation between the questionnaire and intelligence test (i.e., 
the correlation between the first and last rows of Table 2) is, in fact, — -29.* 
Thus, in proceeding from the actual to the adjusted school attainments, a 
component negatively correlated with linguistic background, has been removed. 


* This correlation, in the opinion of the writer, justifies a caution concerning the 
validity of the intelligence test when used as a comparative measure of ability for monoglot 
and bilingual children. Of course, as the standard error of the coefficient works out at -24, 
it could be claimed that the coefficient, as a statistic, is consistent with a parameter of 


zero (see, however, Lewis, 1959). 
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Correlations between the actual school attainments and linguistic background 
may, therefore, be expected to yield negative coefficients of even greater 


magnitude.* 


V.—SUMMARY OF CONCLUSIONS. 


(1) Attainment tests of English (comprehension and usage) and arith- 
metic (mechanical and problem) were given to all the 10-year-old pupils from 
sixteen primary schools in mixed-language areas in Wales. A non-verbal test 
of intelligence and a linguistic-background questionnaire were also given. 
The attainment differences between schools were considerable, though in the 
case of English comprehension, the over-all dispersion of school means was not 
significant. 

(2) These attainment differences were found to be related to the corres- 
ponding differences in intelligence (this relationship being noticeably less in the 
case of mechanical arithmetic, however). In all the attainment tests the 
School means tended to increase with intelligence, but not regularly. 

(3) The school attainments were adjusted to a common basis, as regards 
intelligence, by the analysis of covariance. The over-all dispersion of the 
adjusted school means was significant (at the 1 per cent. level) in all the attain- 
ment tests. Differences in intelligence between schools do not, therefore 
account fully for their attainment differences. 

(4) A linguistic background index was formulated to assess the ' Welsh- 
ness ' of background of the school groups. Schools whose attainments in English 
—adjusted for intelligence—were markedly high, were observed to have low 
linguistic indices (i.e., low Welsh background), while schools whose adjusted 
attainments in English were low had high linguistic indices (ie., high Welsh 
background). The tendency for high adjusted attainments to be associated 
with low linguistic indices and for low adjusted attainments with high linguistic 
indices was confirmed as being strong in both English tests, and was especially 
strong in English usage. (There is some evidence, too, to suggest that a rela- 
tively poor attainment on English. usage, as compared with English compre- 
hension, is also associated with a high linguistic = — 

i ic. on the other hand, the tendency for high adjusted attain- 
"s. iamen Be with low indices, and low adjusted attainments with high 


indices, is only slight. 
(5) Linguistic backgr 
with respect to those schoo 


ound may thus be regarded as an important factor 
1 differences of attainment in English which are not 


i intelligence. With regard to the corresponding attainment 
Ce v Ai testi linguistic background is of less importance, further 
research, being necessary to decide whether the slight tendency observed is real. 
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A CRITICAL STUDY OF BILINGUALISM AND NON- 
VERBAL INTELLIGENCE . 


Bv W. R. JONES 
(Department of Education, University College of North Wales, Bangor.)* 


Summary. Previous investigations undertaken in Wales have yielded some- 
what contradictory results concerning the influence of bilingualism on perform- 
ance in non-verbal tests of intelligence. More recent surveys, planned on a 
wider scale, consistently reveal a tendency for mean scores on such non-verbal 
tests to decrease with increasing Welsh linguistic background, and also show 
statistically significant differences in favour of monoglot (English) groups by 
comparison with bilingual (Welsh) groups. In the present study, reference is 
made to a re-analysis of the results of one of these large-scale surveys in relation 
to the parental occupations of the pupils tested. The findings of this re-analysis 
indicate that, whereas monoglot and bilingual groups which vary significantly 
in occupational class also differ significantly in non-verbal intelligence, corres- 

onding linguistic groups of comparable socio-economic status do not differ 
significantly in this respect. It is, therefore, concluded that bilingualism need 
not be a source of intellectual disadvantage. The importance of a thorough 
examination of socio-economic factors in any comparative study of monoglot 
and bilingual children is strongly emphasised, and a study of bilingualism and 
non-verbal intelligence which appeared in a recent issue of this Journal is 
critically examined on account of its defects in this and in certain other respects. 


I.—INTRODUCTION. 
Tue general trends in the literature dealing with the effects of bilingualism on 


intelligence warrant the conclusion that, whereas bilingual children suffer from 
a language handicap in verbal tests of intelligence, there is no evidence for a 
similar inferiority when their performance on non-language tests of intelligence 
is measured against that of monoglot children (Rumyaneck 1931, Arsenian 1937, 
and Darcy 1953). It is not surprising, therefore, to find that from about 1930 
onwards, researchers in Wales have used non-verbal as well as verbal tests of 
intelligence in their experimental studies of bilingualism. Barke (1933) found 
that bilingual children were, 0n the whole, slightly superior to monoglot children 
in a non-verbal test of intelligence, whereas m a verbal test administered entirely 
in English, the bilinguists were consistently inferior to the monoglots. Both 
abu T were thought to compare very fairly from the 


ilingual groups x ore 
T apibus a mental occupation, although further inquiry suggested that 
ine home environment of the bilingual group was somewhat superior to that of 


a subsequent investigation, Barke and Parry-Williams (1938) 
eee ae habe me addition, discovered that the superiority of the 
monoglot group was even more marked in a verbal test of intelligence given to 
each group in its mother-tongue. The conclusions of both investigations, 
however, are considerably weakened by the complete absence of proper criteria 
for evaluating the statistical significance of differences in mean scores between 
the various groups compared. Jones (1933) compared monoglot and bilingual 
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monoglot and bilingual groups on any of the tests within the scale. James (1947) 
as part of an enquiry into the general educational performance of monoglot and 
bilingual groups administered Form A of a non-verbal test of intelligence in 
English to both groups, followed by Form B administered in Welsh to the 
bilingual group and in English to the monoglot group. Group differences in 
mean I.Qs. were then evaluated by the t-test of significance, and the results 
showed that there was no significant difference between monoglot and bilingual 
groups of similar social background on either administration of this non-verbal 
test. Jones and Stewart (1951) were mainly interested in the influence of the 
verbal factor on the comparative performance of monoglot and bilingual groups 
in intelligence tests. They found highly significant differences in favour of the 
monoglot group on both non-verbal and verbal tests of intelligence. The 
difference in favour of the monoglot group in ‘ verbal intelligence’ remained at 
a highly significant level even after adjustment had been made by the methods 
of analysis of covariance to allow for the initial difference between the groups on 
the non-verbal test of intelligence. It was, therefore, concluded that the 
difference in adjusted verbal scores was probably due to the verbal factor in 
intelligence tests, which had seemed to work against the bilinguists in relation 
to the monoglots. The investigations of Jones (1933), James (1947) and Jones 
and Stewart (1951) show considerable advances, not only in the application of 
statistical techniques to the bilingual problem, but also in the adoption of various 
oa for quantitatively assessing the linguistic background of the children 
tested. 


II].— RECENT SURVEYS WITH NON-VERBAL TESTS OF INTELLIGENCE. 
The somewhat contradictory findings of earlier investigations with non- 
verbal tests have stimulated subsequent researchers to undertake surveys of the 


problem on a wider scale than was previously possible. Three recent surveys 
may be mentioned : 


(1) In 1951 the Collegiate Faculty of Education of the University College 
of North Wales, Bangor, undertook a survey in Caernarvonshire schools, when 
all children between the ages of 10 years and 11 years 11 months were given a 
series of tests. The main purpose of the survey was to standardise two intellig- 
ence tests on groups of children of varying linguistic background. One of these 
tests was non-verbal in content, but administered with oral instructions which 
had been adapted into Welsh for use with Welsh-speaking children. Particularly 
important for the work of standardisation was the Language Questionnaire, 
which had been specifically prepared in connection with the survey along the 
lines of a well-known Bilingual Schedule (Hoffman, 1934). Details concerning 
its construction, administration and methods of scoring have been published 
elsewhere (Miles, 1954). By combining Language Questionnaire Ratings and 
Head-teachers' Estimates of children's linguistic background, pupils were 
eventually classified into four linguistic groups as follows : (a) Welsh, (b) Mixed- 
Welsh, (c) Mixed-English, and (4) English. Separate statistical analyses were 
then carried out for the 10-year and ll-year levels. A striking feature in the 
results was the tendency for mean raw scores on the non-verbal test to increase 
as the linguistic composition of the group became progressively English. The 
over-all differences between the group means were also highly significant at 
both age levels.* More detailed comparison of the various group means by an 
application of the /-test of significance revealed highly significant differences 1n 

* The respective values of F were 11-89 (df=3 and 1,215), P<0-001 and 21:36 (df=3 
and 1,294), P. 0-001. (Appendix Tables C and D, Jones 1955). 
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4 favour of the English and Mixed-English groups by comparison with the Welsh 
group, and also a significant difference in favour of the English group by 
comparison with the Mixed-Welsh group. These differences were equally 
present at both age levels. In addition, at the 11-year level a highly significant 
difference was found in favour of the Mixed-Welsh group by comparison with 
the Welsh group.* A full account of this survey may be found in publications 
sponsored by the Collegiate Faculty of Education at Bangor (Jones, 1955 and 
1959). 

i In 1954 a panel of four researchers representative of the Collegiate 

Faculties of Education at Aberystwyth and Bangor investigated the educational 

attainment of monoglot and bilingual children between 10 and 11 years of age 

in relation to their intelligence and linguistic background. This survey was 


undertaken at the request of the Welsh Joint Education Committee which later 
published a detailed account of the investigation (Jones et al, 1957). In all, 
749 children were tested in primary schools in the counties of Anglesey, Caern- 


arvon, Merioneth, Cardigan, Pembroke and Carmarthen. The non-verbal test 
Fe of intelligence and the Language Questionnaire which were used in the 1951 
Bangor Survey were also administered on this occasion. The children were 
classified into the following linguistic groups, according to their scores on the 
Language Questionnaire : (i) Welsh (+50 per cent. to 4-100 per cent); (ii) 
Mixed-Welsh (0 per cent. to +49 per cent.) ; (iii) Mixed-English ( —1 per cent. 
to —70 per cent.) ; and (iv) English (—71 percent. to —100 percent.) Highly 
significant overall differences were found between the groups on the non-verbal 
test.t A recent statistical analysis of the data has also revealed a tendency for 
mean raw scores on the non-verbal test to increase as the linguistic composition 
of the group becomes increasingly more English, and further comparison of 
group means has indicated highly significant differences in favour of English 
and Mixed-English groups by comparison with the Welsh group.{ In effect, 
the results of this survey seem to corroborate the general findings of the 1951 
^ Bangor Survey. p 
(3) Morgan (1955) made a comparative study of the effects of varying 
degrees of bilingualism on the performance of children aged 10 to 12 years in 
certain non-verbal tests of intelligence. E synopsis of this work appeared at a 
later date (Morgan, 1957). He found a tendency for scores in the three tests 
to decrease with increasing Welsh linguistic background (p. 10). After com- 
ing the mean scores of three broad linguistic groups defined as Welsh, Mixed 
paring the ; d testing the mean differences for statistical significance, he 
and Englis à anc children with a predominantly Welsh linguistic background, 
"epa : un lot and near monoglot Welsh children, are handicapped by 
mes Mensis A non-verbal tests of intelligence.” (p. 14). His investigation, 
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although specifically concerned with the relationship between linguistic back- 
ground and test-performance, also afforded some evidence for the influence of 
other variables, such as urban-rural differences and socio-economic or cultural- 
educational factors. 


III.—THE IMPORTANCE OF SOCIO-ECONOMIC FACTORS IN ‘ BILINGUAL ' RESEARCH. 


From the unanimous findings of the three recent surveys which have been 
briefly summarised in this paper, one is tempted to conclude that bilingualism 
has an adverse effect on performance in non-verbal tests of intelligence. Such 
a conclusion, however, should not be finally accepted without further examina- 
tion of the evidence, particularly with reference to the influence of variables 
other than the bilingual factor on test performance. For this reason, the present 
writer has analysed the results of the 1951 Bangor Survey in the light of socio- 
economic data which had been made available to him through the co-operation 
of the School Welfare Officers in Caernarvonshire (Jones, 1959). Detailed 
information was thus obtained concerning the parental occupation of each 
child tested in the survey, and these occupations were subsequently coded and 
classified in a form comparable to the one used by the Population Investigation 
Committee.* On re-analysing the results in relation to the occupational data, 
it was found that various linguistic groups which did not vary significantly in 
occupational class did not differ significantly in non-verbal intelligence. Such 
groups were mainly located in the urban areas of the county. On the other hand, 
highly significant differences in non-verbal intelligence were found between 
corresponding linguistic groups situated mainly in the rural areas of the county, 
and such groups were also shown to differ significantly from each other in respect 
ofoccupationalclass.f It also appeared that these highly significant differences 
in non-verbal intelligence between linguistic groups in the rural areas could be 
adequately accounted for by occupational rather than linguistic variations 
between the groups, since variations of the latter kind were equally operative 
amongst the urban linguistic groups which showed no appreciable differences 
in non-verbal intelligence. It was, therefore, concluded that bilingualism 
need not be a source of intellectual disadvantage. Finally, this re-analysis of 
the 1951 Bangor Survey emphasised the importance of a thorough examination 
of socio-economic factors in any comparative study of monoglot and bilingual 
children. 

A recent study of bilingualism and non-verbal intelligence which appeared 
in this Journal (Lewis, 1959) is signally defective in this and in certain other 
respects. The study contains unacknowledged extracts from the data of the 
1954 W. J.E.C. Survey, already referred to in this paper-§ It is, indeed, regrett- 
able that the article itself contains no reference to the survey as the source of 
Lewis's data, so that the general impression left upon the reader is that the 
‘author is throughout describing and discussing his own original data. The 
complete data had been previously published under the names of the four 


* The Maternity Inquiry in Great Britain. Population Investigation Committee and 
Royal College of Obstetricians and Gynaecologists. Oxford University Press, 1948. 
t Significance of differences in non-verbal test score : 
F=1-53, df=3 and 663, P<0-200. 
Significance of differences in occupational class : 
—15-175, df=15, P0-70—0-50, 
t Significance of differences in non-verbal test score : 
F=24-78, df=3 and 1,719, P 0-001. 
Significance of differences in occupational class : 
x= 132-093, df=18, P<0-001. 
§ Supra, p. 5. 
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researchers who were solely responsible for planning and directing the whole 
survey, and Lewis's contribution on the statistical side had been duly acknow- 
ledged at the end of this publication (Jones et al, 1957). 

Adopting some of the data of the W.J.E.C. Survey, but using a different 
linguistic grouping, Lewis finds a real difference between monoglot and bilingual 
children in performance on a non-verbal test of intelligence, and suggests that 
this difference may be due, in part, to (i) the test being a timed one, and (ii) the 
groups not being equal with respect to urban-rural differences. He also suggests 
that the presence of a small but appreciable verbal factor in the non-verbal 
test as shown by Emmett (1949) could well prove to be far from negligible in 
assessing the intelligence of the bilingual child. 

To suggest certain characteristics of the non-verbal test of intelligence as 
partial explanations of the statistically significant differences in favour of 
monoglot children is not altogether convincing, since similar tests have been 

s without any disadvantage to bilingual children by 


used in other investigations w lva ; 
comparison with monoglot children. Indeed, it is particularly relevant to note 
in this connection that the non-verbal test of intelligence* as well as the corres- 


ponding Welsh versiont referred to in the author's study were also used in the 
1951 Bangor Survey with definite indications that groups of monoglot and 
bilingual children of comparable socio-economic status do not differ significantly 
in average performance on the test (Jones, 1959). There remains the third 
explanation suggested by the author, namely, the influence of urban-rural 
differences on test performance, due to “a slightly higher proportion of the Welsh 
speaking bilinguals coming from a rural environment than of English-speaking 
monoglots " (Lewis, 1959, p. 21). Studies which provide information about 
urban-rural differences do, in fact, indicate that rural children are found to be 
lower scorers than town children, but, when the analysis is carried further and 
pupils are grouped according to the occupational status of their family, the 
crude urban-rural differences tend to disappear. According to a recent survey, 
“a comparison of the urban and rural professional groups reveals that the 
difference between their mean scores 1s negligible; a similar comparison of 
rural and urban manual groups likewise shows little significant difference. 
However, when the mean score of the combined urban and rural professional 

oups is compared with that of all those employed in manual occupations, there 
ir c a significant. difference with the former achieving the higher mean 
appears (B: S 1959, p. 58). It appears that the wide differences in occupational 
P n l { set suall » exist between urban and rural parts of the country, may be 
isa aa don prominent amongst the various possible causes of 
regarde con One would, therefore, expect to find occupational as 
pe pac 1 urban-rural differences revealed in Lewis's analysis, but, rather 
well as broad ura * the groups of this investigation did not differ 

ingly, it is stated that e groups ¢ : 

Surprising y, jth regard to parental occupation " (Lewis, 1959, p. 21). No firm 
appreciably ered T support of this statement ; on the contrary, the author 
hen 25 xe that “ a more thorough analysis of the socio-economic factor 
hie mE vealed some differences " (Lewis, 1959, p.21). Such an inadequate 
might have reves me. ly unsatisfactory but also difficult 


re 3 » socio-economic factor is not oniy : i 
p po of the fact that precise information concerning parental 
cg inte was available to the author in the case of each pupil tested in the 


original survey. 


In conclusion Lewis's discussion of the Language Questionnaire also calls 
sion, 


fi mment. The possible range of scores on the questionnaire extended from 
or co 5 


; skins: A Scale o Non-Verbal Mental Ability (N.F.E.R.). pee 
1 k * Jones: Profion eo Aaah Di-iaith, Cyfaddasiad Cymraeg (N.F.E.R.). 
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+100 per cent. (predominantly Welsh) to —100 per cent. (predominantly 
English), and it was originally decided to separate off a group of children who 
were definitely Welsh (with scores from +50 per cent. to 4-100 per cent.), a 
second group of those who were definitely English (with scores from —71 per 
cent. to —100 per cent.), and a third group labelled ‘ Mixed ’ group (with scores 
from +49 per cent. to —70 per cent.). Lewis maintains that a Welsh group so 
defined will show considerable differences in linguistic background, and he 
consequently suggests a group more narrowly defined with scores ranging from 
+90 per cent. to +100 percent. Correspondingly, his English group is restricted 
to pupils with a range of scores from —90 per cent. to —100 per cent. The 
remaining children are divided into two groups with scores ranging respectively 
from 0 per cent. to +89 per cent., and from 0 per cent. to —89 percent. He also 
emphasises that the Welsh and English groups so defined are secured without a 
serious depletion of numbers in each group. Numerical considerations of this 
kind are not relevant to methods of linguistic classification, which are usually 
decided upon in accordance with principles related to the general linguistic 
patterns prevalent in the community. It is also essential that a decision 
concerning the methods of linguistic classification should be made well in 
advance of the investigation proper and without any preconceived ideas as to 
the adequacy of numbers in any given linguistic category. Strong reasons have 
been offered in support of the original method of classification adopted by the 
designers* of the Language Questionnaire and an examination of pupils’ 
answers has indicated the sort of basis on which their decision was made 
(Miles, 1954). In view of these reasons, other possible grouping plans, including 
the one used in Lewis’s study, were rejected by the four researchers of the 
W. J.E.C. Survey in favour of the classification originally proposed in connec- 
tion with the Language Questionnaire. Finally, it should be borne in mind 
that no single set of questions can adequately reflect the intricate and often 
contradictory characteristics of a pupil's linguistic background. At best, a 
Language Questionnaire is but an imperfect instrument, hardly suitable for 
discriminating between the finer shades of linguistic background. It would, 
therefore, appear that broad linguistic classifications are often more realistic 
and consequently more informative for practical purposes than any attempts to 
separate pupils into linguistically homogeneous sub-groups, artificially defined 
within excessively narrow limits of questionnaire scores. 


* J. Rogers, T. R. Miles, and W. R. Jones, 
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NOTES AND COMMENTS 


REPLY TO: A CRITICAL STUDY OF BILINGUALISM AND NON- 
VERBAL INTELLIGENCE (By W. R. JONES) 


By D. G. Lewis. 


One important difference between Jones and the writer concerns the selection 
of groups. Jones approaches this with pre-conceived ideas as to where the group 
boundaries should be; the decision, he maintains, should be made well in advance 
of the investigation proper. The writer, on the other hand, maintains that the 
question should be finally settled only when all evidence to be gained from the 
investigation is available. In particular, a detailed analysis of the questionnaire 
data should first be made in conjunction with the teachers’ ratings of linguistic 
background. In an analysis of the questionnaire data from schools in the Aberyst- 
wyth Faculty Area, the writer found that the broadly-defined “Welsh” group of the 
W.J.E.C. Survey contained a substantial minority of children (about 35 per cent.) 
from homes where Welsh was not always spoken. Sometimes only one parent (usually 
the mother) would be constantly speaking to the child in Welsh, and often there 
would be brothers and sisters with whom the child would converse mostly in English. 
Clearly differences of background within a group so defined are very considerable : 
the group boundaries are excessively wide. And this is particularly unfortunate 
since it is in a comparison of the two "extreme" groups (Welsh and English) that 
any effect of a bilingual background can best be assessed * Marked heterogeneity 
within the “extreme” groups is avoided, however, in the grouping adopted by the 
writer. This grouping, moreover, agreed well with the teachers’ ratings. Other 
tentative groupings, including the one favoured by Jones, did not show the same 
measure of agreement. 

Jones appears to misunderstand the reference to depletion of numbers. These 
numerical considerations, while not affecting the position of group boundaries, are 
by no means irrelevant. If by restricting the Welsh group to those of a thoroughly 
Welsh background, numbers are seriously depleted—so that a group mean score 
would involve an unduly large standard error —the remedy is not to widen the group 
boundaries but to test more children. As it happened, however, this was not 
necessary. . 

Jones’ other two criticisms merit but brief comments. A thorough investiga- 
tion of socio-economic status would involve other variables besides parental occupa- 
tion. The group comparison of parental occupation merely ensured there were no 
gross disparities. (Incidentally, the W.J.E.C. Report completely ignores the socio- 
economic factor. Finally, the original questionnaire and intelligence-test data had 
been made available by the Department of Education at Aberystwyth. The writer 


is glad to acknowledge the generous co-operation of all in the Department interested 
in this field of research. 


Comments on Dr. Bene's Article. 


Bv W. B. DOCKRELL 
(University of Alberta) 


In her article “ Suppression of Heterosexual Interest and of Aggression by 
Middle Class and Working Class Grammar School Boys," in the Journal of Novem- 
ber, 1958, Dr. Bene concludes that '' The present results indicate only that there are 
no social class differences regarding suppression of aggression." The data included in 
the article are limited and it is difficult to re-work the material, however, it does not 


* As pointed out by the writer (this Journal, 29, 20) this comparison is essentially one 
between a monoglot (English) group and a bilingual (English-Welsh) group. 
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seem that this conclusion is supported by the evidence presented. The differences 
between the two groups are small on some of the items designed to indicate suppres- 
sion of aggression, but not so small that one would accept the null hypothesis. 
None of them are large enough for one to reject the null hypothesis, but not rejecting 
and accepting are two very different matters. This may merely be a slip in wording, 
but there are no doubt readers who took this clause literally. 

. Without further information, it is difficult to decide whether the data presented 
might not be interpreted to support the original hypothesis. One would wish to 
know rather more about the sample than can be learned from this article. For 
instance, Dr. Bene refers to four grammar schools but describes only three. A 
grammar school drawing predominantly working-class boys might convey middle 
class standard rather less effectively than a mixed or predominantly middle class 
one. Similarly, one would wish to know more about the sample of modern school 
boys referred to in the discussion. : 

The form of the testing instrument itself also makes re-interpretation difficult. 
What was the precise form of the items used to indicate suppression of aggression ? 
At least two were open ended stories where the endings were classified as non- 
aggressive in terms of overt behaviour; non-aggressive in terms of thoughts and 
feelings; and what else? How many of the other endings were aggressive, not 
completed, non-classifiable or characterised in some other way ? The other six items 
scem to call for a straight agreement or disagreement but how many ' don't knows ' 
were there and were the ‘don’t knows’ evenly distributed between the groups ? 
If there were three or more classifications of responses, aggressive, non-aggressive, 
don’t know and some others, were the differences in numbers of aggressive responses 
not significant ? i P. 

In analysing the data, Dr. Bene tested each item for significance individually. 
From the data given, it would seem reasonable to divide the questions into projective 
items of feeling and descriptions of preferred behaviours. The two ' projective " 
items are counted twice because in both cases two different classifications of response 
are labelled non-aggressive. Can the two groups of non-aggressive responses to the 
story about Fred be combined and contrasted with the aggressive responses ? If we 
combine these two groups and contrast them with the responses not described in the 
article, the X? exceeds the -05 level for a one-tailed test. The other projective item, 
the story about Bob, is the only item in which the proportion of non-aggressive 
responses from the working class boys exceeds that from the middle class boys, 
so that if the responses from the two projective questions are combined, the 
difference between the two groups 1s not significant. On most tests we expect the 
total score and not the individual items to discriminate. It is not clear from the 
article whether it would be legitimate to combine items in this case. If, however, 
we do add the ' behavioural ' items, we find a X? greater than -025 for a one-tailed 
nis In her discussion, Dr. Bene refers to other uses of the same instrument as 
evidence of its validity. Were any investigations made into its reliability ? Were 

for this is cited as a relevant variable ? 


the two groups matched for family size, a r 
At most Dr. Bene's findings are inconclusive. She failed to find differences as 
large as, or of the same kind as, she had found between middle-class boys who had 


several siblings and those whose life evolved mainly round adults—though nine of 
the ten differences between the social classes were in the direction hypothezised. 

More extensive testing with the same instrument might yield reliable differences 
between the social classes, so might a different form of investigation. In the mean- 


time, we must suspend judgment. 
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Reply to the Comments by Professor Dockrell. 
By EVA BENE 


I would like to start my reply to Professor Dockerell's comments by stating 
that I agree that an investigation into social class differences in England which 
uses different techniques from those I used, might yield different results. We can 
never be certain that research results correspond with reality but only believe 
with a greater or lesser degree of confidence that they do. I hope that not many of 
my readers have understood me to claim that the fact that I did not find a significant 
difference between middle class and working class grammar school boys regarding 
suppression of aggression definitely proves that such a difference does not exist 
between the social classes in England. 


The objections Professor Dockrell raised against my paper are to some extent 
semantic in origin. They partly hinge on the meaning of the word ' indicate.' While 
I used this term in the sense of ‘ suggest, he appears to have understood it in the 
sense of ' prove .' It would be interesting to know which meaning is more frequently 
attached to this word when it is used in the phrase: “ The results indicate." 
Another difficulty of a semantic nature has arisen from the circumstance that 
Professor Dockrell attached great Significance to the meaning of half a sentence 
taken out of its context. His main objections are to the following phrase: “ The 
present results indicate only that there are no social class differences regarding 
suppression of aggression." This sentence continues in the following manner: 
'"... they do not show to what extent both groups of boys express or suppress their 
aggressive feelings." Professor Dockrell thinks that the phrase which he quotes is 
misleading. Maybe it is, but so is taking it out of its context which gives its meaning 
à different slant, and so is ignoring the next sentence which begins with the words : 
*' Tt 8eems-. 2 T 

Now to the objections on technical grounds. Professor Dockrell is correct in 
saying that I referred to four, but only described three grammar schools. The 
sentence describing the grammar schools Should have read : ‘‘ One school was located 
in a predominantly working class district, one in a predominantly middle class 
district, and two in districts in which both scoial classes were about equally 
represented." Unfortunately, part of this sentence was left out, but fortunately, 
the paper contained references which should enable any interested reader to find out 
more about the sample. 

Professor Dockrell's next objections have to do with the story completion items. 
In the case of these items each subject had to give two separate responses. He first 
had to say what he thought the hero of the Story did in the situation, and then what 
he imagined the thoughts and feelings of the hero were. For some reason, Professor 
Dockrell thought that the responses to each of these items were counted twice when 
it was not legitimate to do so. Yet he combined them and worked out a Chi square 
on the basis of the combined responses. Since he only had the percentage figures 
published in the article at this disposal, it is quite mysterious how he went about his 
calculations, and the Chi square he gives appears to be meaningless. The same holds 
for his calculations of the other Chi Square. These Chi squares are meaningless not 
only for statistical reasons but also because i 
which are different in kind. He could have Seen from the table that some of the 
' behavioural ' items were statements and others were not. 


. Professor Dockrell also states that "... nine of the ten differences between the 
social classes were in the direction hypothesized," whereas it can be seen from the 


table that one of the ten comparisons showed no difference, seven went in the pre- 
dicted and two went in the other direction. 


However, Professor Dockrell also raises Some interesting points. He would 
like to know how many of the non-aggressive responses were aggressive, how many 
non-classifiable, how many not completed, etc. I can understand that a reader who 
is really interested in a piece of research would like to see all the data. The reason 
why it is not given is that this would result in papers which are too long and contain 
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into ee tables for publication. He also asks whether any 

€ validity of the items used. The question of the validity of the instruments 

two. Alllcan give as an answer 


ban Top complex to be answered in a sentence or t : Y 
the in d is almost impossible to validate psychological tools which have to do with 
uo o ee oe of emotions, at least to validate them in à straightforward 
with whi The reason for this is that there are no valid external criteria available 
which one could compare the obtained results. 
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Finally Professor Dockrell raises the question whether it is 
expect individual items instead of total scores to discriminate. The answer again is 
yes. Both methods have their advantages and disadvantages. Some of the 
advantages of working with separate items are the following: One does not lose 
sight of the results obtained by individual items. One can use different types of 
items and thereby eliminate the danger that the responses are too much influenced 
by the manner in which the items were phrased. One eliminates the difficult and 
often unanswerable questio uch weight should be attached to the different 
one can draw conclusions on 
between the responses 


n of how m 
responses. Regarding the interpr 
asis of the difference 


proper in research to 


the basis of the pattern cance of the differences 
of two groups to a number of items just as one can on the b 
between the means of two groups. 

I would like to finish my reply to Professor Dockrell by stating once more that 
I believe as firmly as he does that the fact that in an investigation no significant 
differences were found does not prove the null hypothesis to be true. 
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RESEARCH NOTE 


The Peabody Picture Vocabulary Test with English Children.! 


By JAMES W. MOSS, Ph.D.* 
(Medical Research Council, Maudsley Hospital, London) 


AND PHYLLIS EDMONDS 
(Institute of Education, University of London) 


The present study was conducted to provide preliminary data on the utility 
of the Peabody Picture Vocabulary Test? (PPVT) with English children. The PPVT 
is an instrument recently developed in the United States which purports to measure 
comprehension of the spoken word, including the ability to associate verbal symbols 
with pictorial representations of objects, events, and actions. The test consists of 
150 plates arranged in ascending order of difficulty. Each plate contains four 
pictures: three decoys and a correct one. The same set of plates is used for both 
forms of the test. The test is administered by having the child indicate which picture 
on the plate best illustrates the meaning of the orally presented stimulus word. 
No particular form of response is required. The subject is tested from a basal of 
eight consecutive correct responses to a ceiling of six errors out of eight consecutive 
responses. 

The PPVT was standardized on random samples of normal children from 2 years 
6 months through 18 years of age, but it was specifically designed to be useful with 
handicapped subjects. The test is not timed. The comparatively large drawings 
make it of value with some partially-sighted children. 1t may also be used with 
cerebral palsied and speech-handicapped children. 

The PPVT was administered to 101 English school children between the ages of 
six and nine from two different London County Council schools. The Otis Group 
Intelligence Scale, Primary Examination, was administered at the same time. It 
was found, with the age range covered, that the PPVT basal and ceiling levels of the 
children generally fell between plates 41 and 90, inclusive. For the item analysis, 
therefore, only the responses to these plates are considered. Only those children in 
the sample with I.Qs. 90 to 110, inclusive, were involved in the cross-cultural 
comparison of PPVT scores. The sample was divided into two age groups 1n order 
to increase the reliability of the results. The high age group (N—21) ranged in 
CA from 7:6 to 8: 5 with a mean CA of 8:0. The low age group (N=17) ranged 
from 6:3 to 7:5 with a mean of 7:2. No differences were found between the 
mean scores or the variances of the English and American children at either age level. 
The correlation between the PPVT and the Otis Scale was 0°68. With the effects of 
CA partialed out, the correlation was reduced to 0-64. 

The pictures in the PPVT are, in general, appropriate for English children, 
although there are a few drawings, such as the American policeman, baseball bat, 
or coins which might cause difficulty. Some of the words, or word-picture combina- 
tions are inappropriate resulting in a change in the difficulty level for the particular 
item. The present arrangement of the items is not wholly appropriate for use with 
English children as it does not represent a smooth, ascending order. 

The general conclusions reached by the present authors is that the PPVT is 
potentially a useful instrument for use with English children. Although it can 
evidently be used in its present form with the American norms, it should be re- 
standardized for English usage. 


! This project was supported in part by Research Grant MF 8599 fr. National 
Institute of Mental Health, Bethesda, Md. An extended report of this osc wey Da 


obtained without charge from James W. Moss, San Francisco State College, San Francisco, 
California. " 


? Now at San Francisco State College, San Francisco, California. 


3 Dunn, L. M. Manual, Peabody Picture Vocabulary Test, Te i iti George 
Peabody College for Teachers, Nashville, Tennessee, 1959. ERA ORCS 
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SUMMARIES OF RESEARCHES REPORTED IN DEGREE 
THESES 


Sources of Fluctuation in Scores from Successive Group Intelligence Tests. 


By NORMAN W. KING 


(Abstract of a thesis submitted in part-fulfilment of the requirements for the Degree of 
Bachelor of Education at Aberdeen University, in 1959.) 


AIM, 
To examine the incidence of discrepancies of ten or more points of 1.Q. from 


administered within an interval of one week, and to discover 


two parallel group tests m 
vidence from the test booklets indicates 


the proportion of such cases where internal e 
one or other as the ‘ correct ' score. 


SUBJECTS. 


In a year-group of 2,768 Aberdeen children aged 11, there were 194 cases where 


discrepancies of ten or more points were found between scores from two successive 
intelligence tests (after practice effect had been removed by re-standardizing the 
scores in each test to a mean of 100 and a standard deviation of 15 in the year- 


group). 


TESTS. 
Moray House 1 
interval; previous practice an 


Test 49. 


Verbal Reasoning Tests 55 and 56 were given, with one week's 
d limited coaching having been given on Moray House 


METHOD. K : 
The 194 cases were classified from a study of the test booklets into four groups : 
where raw score on both tests was over 90 out of 100 items, 


(a) Ceiling cases : } 
and the discrepancy arose through inadequacy of the standardization at the 


highest limits (20 per cent.). 
(b) Slow performers : where one of the two tests was unfinished, at least 


fourteen items being left unanswered (8 per cent.). 
(c) Block errors : where, amid a run of correct items, a whole section was 
wrong because the instructions had been misunderstood (17 per cent.). 


(d) The remainder (55 per cent.). 


RES discrepancies of 10 points or more in 108 boys and 88 girls (7 per cent. of 
the group), constituted a hi her incidence than would have been predicted from the 
reliability coefficients o: the tests (slightly over 3 per cent). There were eight cases 
or more. Nine-tenths of the discrepancies showed an 


di neies of 20 points 
eens of the two Fores) above the group mean of 100. 


The large proportion of ceiling cases was partly the result of the accumulation 
of practice and coaching, the mean score in the tests before re-standardization being 
104-9 in the first and 108-4 in the second test. 

An attempt was made to determine the ‘correct’ score for group (d), where there 
was no clear internal evidence of the source of the discrepancy, using as a criterion 
the mean of scores in English and Arithmetic tests and scaled teachers’ estimates. 
In half the cases, the mean of the two I.Qs. came closest to this criterion ; in the 
other half, in approximately equal numbers, the higher or the lower score was nearer 


to the criterion. 
T ION. r ae 
kd" ~ discrepancies of 10 or more points occur, it is recommended that the 
* For ceiling cases, slow performers and block errors, the 


ined. : i 
test booklets deri om and this would deal with nearly half the cases. 
higher score should be acipta a a e must bS taken as the best available 


For the remainder, the mean o! 
estimate. 4 
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A Factorial Study of the Relationship between the Child’s Vocabulary and 
His Reading Progress at the Infants’ Stage. 


By ERIC POTTS 


(Abstract of thesis submitted for the degree of M.Ed. in the University of Manchester, 
April, 1959.) 


AIMS. 

In the literature concerned with the preparatory stages of reading, teachers are 
often urged to take steps to extend their children’s vocabularies. The authorities 
aver that the child with the more extensive vocabulary is better equipped to tackle 
the printed word. However, there is little published evidence obtained from children 
in the early stages of reading, to support this view. The aims of this research were, 
therefore : 


(1) To test the hypothesis that an extensive vocabulary predisposes a child to 
early success in reading. 


(2) To analyse the factors underlying a battery of tests chosen for their assumed 
relevance to the measurement of reading attainment and vocabulary know- 
ledge. 


THE SAMPLE. 

One-hundred-and-seven boys and ninety girls, aged between 72 and 78 months, 
drawn from eight infants’ schools, performed all the tests of the present battery. 
The distribution of intelligence quotients in this sample agreed closely with the 


normal frequency curve, and their parents’ occupations were typical of an urban 
area of South-East Lancashire. P pP 


METHOD. 

A multiple-choice pictorial vocabulary test was constructed in order to obtain 
measurements of the comparative sizes of the children’s vocabularies. It was 
assumed that this estimate of vocabulary extent would be directly proportional to 
the absolute vocabulary of the subject. A Specially adapted sampling technique had 
to be employed for selecting the items, as Sait wove from certain Thorndike 
frequency categories were considered suitable for ih age group. This test, presented 
in two parallel forms, was found to have a reliabilit: of -923. Two other tests 
were constructed, one to measure the child's ability to Armina visually, and the 


other, composed mainly of nonsense words, to assess ability to read regular phonic 
units. ic 


The complete battery of tests and the assessments used were as follows : 
(1) Chronological age. 

(3) The Burt-Vernon Word Reading Test 

e A pictomal = of Visual Discrimination 

e Moray House Picture Intelli Tes 

tn Net school attendance. ntelligence Test (Mental Age). 
7) The Schonell Silent Reading Test A 

(8) A test of‘ Phonic ' ability. ij 

) Social and Cultural Environment rating. 


STATISTICAL ANALYSIS. 


The differences in means in reading score (Burt-V. 


mental age, for school and sex groups 
‘t’ ratio. Broups were tested for 


ernon), vocabulary score and 
significance by means of the 


The inter-correlations of the tests and other th 

assessments were calculated by the 
product-moment formula. The resulting correlation matrix (Table 1) was submitted 
to factorial analysis by the centroid method. 
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TABLE 1 
THE CORRELATION MATRIX. 


Variable 


Chronological Age... - 
(2) Vocabulary Score ...- 
(3) Burt-Vernon Reading 


COE & iin gay UM OSSA 
(4) Test of Visual Di 
crimination...... 548 |--077 | -443 | :330 279 
(5) Morey House Mental 
PURE. s apni mp acest ‘056 | -590 | :55 ks 
(6) Net School Attendance “149 p -204 
(7) Schonell Silent Read- 
ing Test A ....-- 783 | 381 
H 414 


(8) Test of Phonic Ability 
(9) Environmental Score 


ere tested for significance by four independent criteria, 


The extracted factors were t 
til interpretation became possible. 


and subsequently rotated un 


Tue RESULTS. E 
Highly significant differences were found among the school means for the 


Burt-Vernon reading test. These were accompanied in a few cases by parallel 
differences in school means for mental age, but another important cause emanated 
from variations in school policy on the introduction of ' phonics.’ As the Burt- 
Vernon test appears to favour children with considerable phonic skill, the schools 
where phonics were introduced early in the reading programme achieved higher 
mean scores on this test. Morecver, the fact that the children in these schools were 
able to ' read ' by purely mechanical means reduced their reliance upon understood 
vocabulary. Thus, the correlations between vocabulary score and Burt-Vernon 
reading score, in the cases of these schools, ranged only from :3tozero. In contrast, 
for the schools in which phonics are deferred to a relatively late stage in the reading 
fficients were of the order of :6. 


programme, the corresponding coetiic! 
The full sample correlation coefficients between vocabulary score and reading 


scores on the Burt-Vernon and the Schonell tests were -34 and -44, respectively, 


despite the intrusion of the purely mechanical means of reading. 
Three significant factors were obtained from the correlation matrix. After 


rotation these were identified as follows : 
I ‘g, which contributed over half the common variance of the test battery. 
Th * Maturity-schooling,' which was concerned mainly with the two measure- 
ments of time, viz., chronological age and net school attendance. 
TIL A factor confined to the three tests of reading, given the title ' phonic 


reading." 
The rotated factor lo: 
Pearson zero coefficient are gr 


adings which are greater than three standard errors of a 


iven in Table 2. 


TABLE 2 
SIGNIFICANT LOADINGS—ROTATED FACTORS. 


Variable 

(1) Chronological Age ... = 787 ae 658 
(2) Vocabulary Score . 684 — — 471 
(3) Burt-Vernon Reading Score 556 230 “666 +805 
(4) Test of Visual Discrimination -575 — -343 
(5) Moray House Mental Age a 927 — — -871 
(6) Net School Attendance ..---- sa = 771 = -596 
(7) Schonell Silent Reading Test ... DE — -566 -760 
(8) Test of Phonic Ability ..| -567 = 647 pa 
(9) E vironmental Score. - ....) 7440 - -357 E 
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The vocabulary test's common variance was contributed only by the ‘ g ' factor. 
The relationship of this test with the two standard reading tests was found to be due 
to the common influence of the' g ' factor. ' Phonic reading ' also made a significant 
contribution to both of the reading tests, but, as this factor is not strictly orthogonal 
to the ‘g’ axis, the indications are that reading attainment is more dependent 
upon general mental ability than the loadings of the reading tests on the ' g ' axis 
suggest. 

Thus, the evidence indicates that, at the six-year-old level, reading attainment 
and vocabulary extent are more in the nature of different manifestations of general 
mental ability than distinct, mutually contributive abilities. However, this finding 
does not necessarily detract from the value of including in pre-reading programmes 
activities directed towards increasing the child's vocabulary. 


Factors Influencing Musicality. 


By P. N. P. WILLIAMS 
(Abstract of a thesis submitted for the degree of B.A. at the University of Reading, 
July, 1959.) 


The aim of this investigation was to determine whether senior ' approved ’ 
school boys differed from non-delinquents in their degree of musicality or feeling for 
music. Music may well afford an important outlet for sublimated anti-social drives. 
It would seem likely, therefore, that unmusical people, who have this channel for 
sublimation restricted, would be more prone to delinquency. 

It was first necessary to find a measure for musicality. Wing in his “ Tests of 
Musical Ability and Appreciation,” lists at least fifty tests which have this aim in 
view. On close inspection, however, these tests would appear to measure either 
memory, intelligence, musical knowledge or aural acuity. Apart from one or two 
tests in certain batteries, for example, Wing's, the emotional aspect of music is 
ignored. It is hardly surprising that these tests have been vigorously criticised by 
musicians. Lowery in his book The Bachground of Music, states that Mursell, and 
others, obtained little evidence of correlation between performance in the Seashore 
tests and musical talent as revealed in school or college work. Mursell emphasizes 
that music is a unity and that analysis incidental to psychological testing destroys 
the music. 

It was decided that a new test must be constructed which would overcome these 
deficiencies. As a piece of music is a ' gestalt ' it was decided to use actual works by 
composers of acknowledged worth. Twenty extracts were chosen from such works 
and each item was performed twice on a piano and recorded. 

One of the two performances was made as musically acceptable as the ability 
of the author as a performer allowed. The other performance was made deliberately 
less ' musical.' For example, the performance might have been ' wooden,' accents 
might be misplaced, or part playing obscured. After hearing each pair of perform- 
ances, the subjects were asked to choose which they ‘ liked better.’ 

The test was administered to ten professional musicians. In seven items tw 
of the ten musicians made an ' incorrect ’ choice, in five i f rrec 
i i Ini : 2 ] ve items one made an inco 
choice, and in the remaining eight items, no ' unmusi ' hosen. 

This group had a x ical performances were c 

3 group mean score of 18-1 and a standard deviation of 2-05. These 
ten musicians were matched with ten non-musicians of comparable age, sex an 


"x -- The non-musicians had a mean score of 13-4 and a standard deviation 
of 2-29. 


Fifty-four subjects were also aske 
of the National Anthem. The mean tes 
task was 16-00 as compared with a m 


d to attempt to identify a disguised version 
t score of forty subjects who succeeded in this 


E : s i ean score of 11-64 for the fourteen subjects who 
failed to identify the music. These results were subjected to a‘ t ' test ane a value 


for 't' of 5-86 was obtained. This is signi 
: i gnificant at the 0-1 per cent level. 
Subjects taking the test were also given A. W. Heim's HÀ intelligence test 
and asked to fill in a questionnaire dealing with their musical background. They were 


* 


| 
| 


- 


| 
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asked for example, " How often do you go to classical concerts? " From this 
questionnaire a musical background rating was obtained for each subject. 

As a measure of reliability twenty-two subjects, chosen at random, were 
retested after two weeks and their scores on the two occasions were found to correlate 


to the extent of r=0-81. 


RESULTS. 
Using normal subjects of ages between 14 and 68, the scores on the test ranged 


from 4 to 20. The mean was 11-73 and the standard deviation 3:43. 
Table 1 shows product moment correlations. 


TABLE 1 
n=127 


Test Score 


Test Score 


Musical Background Rating . E " -507 


Intelligence sd é S -231 
“324 


There was no significant difference in scores between the sexes. 


THE DELINQUENT AND CONTROL Groups. 

A group of thirty-nine senior ‘ approved ' school boys were compared with the 
same number of controls of similar age, sex and intelligence. The scores of these 
two groups are shown in Table 2. 


TABLE 2 


Delinquents | Control Group 


Mean Test Scores 9-95 10-44 
Standard Deviation of Test Score .... i 3.119 2-318 
Mean M.B.R : " +72 3-820 
i 3:441 3-136 


S.D. of Musi 


A't' test was carried out on the test score results and a value was found for 
«4^ of -781, which is is not significant. : 

x Ea “ et ce the musical background ratings of the two groups gave a value for 
© +? of 3-845 which is significant at the 0-1 per cent. level. 

EF: NCE FOR THE SECOND PERFORMANCE. 

Pass cacly all subjects showed a preference for the second performance, although an 
equal number of first (A) and second (B) performances were ' correct.' This prefer- 
ence was not, however, shown by the ten musicians or by the retest group. 


TABLE 3 


“ A" Choices | “ B" Choices | Chi Squared 


poscis V RU 103 97 -09 
10 Musicians : 306 474 18-1 


39 Delinquents - x EN 291 489 25.1 
: 1053 34-0 
222 -018 


39 Controls +- me z 2 
88 Normals dud y; m 
22 Retests 
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Chi squared is not significant for the Musicians and Retest group. On the other 
hand, the preference shown by the Delinquents, Controls and Normals for the second 
performance is significant at the -1 per cent. level. 


Discussion OF RESULTS. 

Perhaps the main outcome of this investigation has been to show the possibilities 
of a test of musicality constructed on these principles. A pilot study conducted by 
the author showed that the differences in the two styles of performance had to be 
comparatively coarse if valid results were to be obtained. 

Table 1 shows that intelligence has little association with musicality as measured 
by the test. This is contrary to Spearman's findings. 

The correlation of r= -507 between test score and musical background rating 
would appear to give support to the teaching of musical appreciation in schools. It 
may well be, however, that people with a predilection towards music ensure that 
their children have a good musical background. 

The preference for the second performance may be due to the fact that the 
subjects are already familiar with the music on hearing it for the second time. 
This hypothesis is supported by the fact that this preference is not shown by the 
musicians or retest group who would already be familiar with the music. 

It might be expected that the higher musical background ratings of the 
delinquents would be reflected in a higher score on the test but this does not occur. 
The delinquents were asked to fill in the questionniare as though they were still at 
liberty. Their higher musical background rating was completely unexpected by the 
investigator. It included all forms of music—classical as well as Jazz. It might be 
argued that the delinquents were trying to please the investigator by claiming to 
be more interested in music than was, in fact, the case. But, judging by the 
questions they asked about filling in the questionnaire, this did not seem to be so. 
Again the only item on the questionnaire where their total score was less than that 
of the controls was the question relating to private music lessons. This would 
appear to reflect a difference of economic Status between the two groups. 

At present it would appear that More research is required to confirm this 
interesting result and to find for it an acceptable explanation. 


par 


-a 


BOOK REVIEWS 


Burr, C. (1959). A Psychological Study of Typography. Cambridge University 

Press, pp. xi +67, 15s. uu 

This book directs attention to the part played by modes of printing and page 
arrangement in the broad field of ideational communication. The author is first 
concerned with problems of type legibility both among children and adults. Con- 
tinuing his work with London school children, Sir Cyril investigates certain 
characteristics of type, including size, form and boldness of letters, width of line, 
spaces between words and leading: unfortunately, though, no evidence is available 
on the effects of type on prolonged reading. However, a useful table of recom- 
mendations for school text-books is presented. Later, a study of aesthetic typo- 
graphical preferences in adults is described, including correlations of these prefer- 
ences with those shown by the same persons in allied fields of pictorial and literary 
Pres ho author draws attention to the provisional nature of experimental results 
based on small numbers of persons. But psychological statisticians will note, with 
considerable interest, the complementary use of P and T-techniques on a single set 
of data, finding here a most useful example of the reciprocity principle. It is a 
matter of conjecture whether clearer results might not have been obtained by 
separate factor analyses in two groups representing ‘ arts ' and ‘ science ’ interests. 
For the small general factor in the T-technique is an awkward partner and tends to 
obscure the main structural pattern. 

Educational psychologists will appreciate that here is groundwork for an aspect 
of research frequently assumed in group comprehension tests, and thought could 
well be given to the extension of these ideas. Nevertheless, confusion could arise 
between the relative effects of type, reading ability, general ability and the ability 
to comprehend (in the sense of answering questions following a passage). In this 
study groups of children are matched for general intelligence and reading ability, 
and tested by means of comparable comprehension test material, in order to estab- 
lish the parts played by the legibility of different kinds of page type and arrange- 
ment. How far the ability of each individual to comprehend (in the above sense) 
has been structured and included in this work is a matter which merits careful 
thought for future investigators. D. M. LEE. 


Jones, W. R. (1959). Bilingualism and Intelligence. University of Wales 
Press, pp. 67, 4s. 6d. 


Mr. Jones's monograph brings out the important point, which had been 
insufficiently realised by most previous workers on problems of bilingualism, that 
the apparent inferiority of bilingual to monoglot children or adults in (non-verbal) 
intelligence is often attributable to the tendency of monoglot families to be of rather 
lower socio-economic status. He provides a useful and critical survey of the large 
number of researches that have been carried out in Wales into these problems, 
and demonstrates from his own data the effects of this socio-economic factor. 


SKINNER, B. F. (1959). Verbal Behavior. London: Methuen, pp. x+-478, 42s. 
B. F. Skinner’s latest book is as complete a behaviourist account of the individ- 
ual's verbal life as one could imagine. In order to achieve this tour de force he has 
stretched traditional behaviourism well beyond some of its earlier limits, as in his 
stimulating account of * autoclitic ' or self-commenting verbal behaviour required 
to account for such troublesome elements as intention, attitude, assertion, composi- 
tion and thinking. Quite apart from its qualities as analysis of verbal behaviour, the 
book reveals its author s intimate knowledge of a vast range of literary material. 
His technique is to describe the behaviourist model in terms solely of a speaker 
onding by his utterances to internal or external stimuli in the presence of a 
Teu listener, through whose reactions language is built up in the speaker. 
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After illustrating the model, he then Studies the action of all sorts of variables upon 
the verbal responses of the speaker drawing upon literature and speech in different 
settings. 

As a thoroughgoing behaviourist he rejects suggestions that we need to invoke 
the independent entities of ' meaning,’ ' idea ' or‘ image.’ A word is a human habit. 
Verbal behaviour can be explained like other behaviour by the principle of con- 
tingency of stimulus, response (by the speaker), reinforcement (by the listener or 
audience). Speech begins by noises that are reinforced to certain stimuli. These 
stimuli are internal needs at first and give rise to demanding speech (ATands). 
There are also other verbal responses—echoic—textual and intraverbal. These 
responses are made to other verbal material. Descriptive responses to objects 
revealing contact with the outside world are called tacts, Finally, tacts may become 
extended by which a speaker Tesponds in a consistent way to common properties 
of similar objects as exemplified in the tact, pen. This allows for the richness of 
vocabulary, for concept formation, metaphor, solecism and the like. The exten- 
sions obviously. incorporate the speaker's past history. The fundamental condition 
TS verbal behaviour relates to the stimuli which gain control of the speaker's response. 

lagrammatically, we may extend Skinner's three-term contingency as follows : 


Si stimuli, 


: Sv other verbal material 
internal needs 


Spea 
RE Sa 
audience responses, 
Verbal stimuli to the speaker, 
response 


Se 
external objects 


Sh speaker's history 


The mand accounts for the 
is under control of Se. 


This is not go, d rbal ivi d 
n t] Bood enough for most of human verba activity an 
in the fourth Part of the book he takes up the higher forms of verbal activity shown 


rem VI, Skinner assumes firstly a listener who 
to autoclitic activity, and secondly. a 5.558, One based upon the other," giving rise 


giving rise to editing and thinking oro e who can react to himself as a listener, 
We must ask what appeal this wil] 


teachers. What kind of a rea ides wil ihi dde to educational psychologists and 


[ a OTOUS re-opening of y problems met 
in the school learning of language. Its fully behaviourist eise d,  particutang 
in the first three parts of the book, so well epitomised in the two quotations on pages 


| 


Mr 
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44 and 45, will challenge and stimulate readers even if they are not prepared to 
accept this wholesale throwing over of such linguistic currency as * ideas ’ and ‘ mean- 
ing.’ In asserting à standpoint that depends almost entirely upon learning, the 
book brings home to the reader the need always to be on the look-out for the 


inculcation of correct verbal habits. 
h of use in it for the students of linguistics, 


As a more specialist text, it has muc 
semantics, logic and English language. These readers, particularly if they know 
something about behaviourism, will probably have to ask themselves in the last 


two parts of the book how far Skinner has been able to adhere to the behaviourist 
limitations which mark the work of the experimentalists like Hull, Spence and 


Skinner himself in earlier writing. : 
The specific and more technical help that this book might give to the educa- 
tional psychologist and practising teacher is, perhaps, more limited, for many of the 
qualities of composition, editing, speech and other forms of verbal activity appear in 
more traditional writing on this subject; but for all that, Skinner's breadth and 
literature reduces the force of this last 


genius in culling so much material from 1 rce 
criticism. At first the book may strike the reader as being so fresh in its approach 


as to be difficult. 1f, however, he takes good care to grasp the essential behaviourist 
model as set out in the first few chapters of the book and as outlined in the earlier 
at much of the apparently more elaborate material 


part of this review, he will find th 1 
falls easily into place. The work as a whole is an original and stimulating contribu- 


tion to the study of language. E. A. PEEL. 


THORNDIKE, R. L., and Hacen, E. (1959). Ten-ihousand Careers. New York : 
London : Chapman and Hall; pp. x+346, 68s. 


John Wiley. ; 

ational psychologists seldom have the opportunity to compare the mental 
Vocatio: psy ppor y P a 
tests they use with the subsequent success of a sufficiently large group employed in 
any one occupation. However, Professor Thorndike has followed up some 10,000 
‘Air Force recruits who had taken a battery of twenty varied tests twelve years 

reviously, and has ontained IUE on their present occupations together with 
certain Y "i Ene tee 5 bs EUM information on over a hundred occupa- 
tions which ha orty or more men each, and analyses the 
one to each of these not only of test scores but also of answers to a biographical 

ata blank. z 

It emerges that the tests and biographical items gi i icti 
of any of the (admittedly unreliable) ea of mi Eis de er iat 
is in line wie a PATA There were, however, distinctive patterns of 
avanit E iem E apia = amog those engaged in different occupations, 
EN pre ie out on ee regi d qm E dues qup. eee mne 
occupational level. The main ' Sub dispo. hus "onn Eee T A 
mathematical-scientific vs. non-mathematical tendencies (e.g. oec and 
mechanical engineer vs. hotel manager) The subsidi imensi 
obscure though they seem (with a little en to eed Mog ees ave) med 
by the reviewer elsewhere: Gregarious vs. Isolated and Verbal | pec Hagae 
The vocational counsellor will, of course, wish to stud es m ae 
and background information in detail, and the book is PA P a ia ane 
He silliness get a great deal of definitive guidance from it, but at least it ien 
on ^ indications where previously there was nothing but unverified 
P. E. VERNON. 


TorGERSON, WARREN S. (1958). Theory and Methods of Scaling. $ 
Wiley and Sons ; London : Chapman and Hall : e ro. id Sky 


Prepared at the request of the Committee on Scalin 

É k the aling Theory and Methods of th 
Social Science Research Council, in the U.S.A., this book uidere a ve use 
hensive and competent synthesis of material previously only available in isolated 
articles or special chapters of books devoted to the subject matter of a particular 
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field. As the author phrases it: ''In all too many cases there has been too little 
communication between and within the various disciplines in developing, naming 
and using the various techniques (of scaling). In some cases the same techniques 
are used in different areas under different names." The book grapples with the 
difficult problem of classifying such methods and goes far in the development of a 
frame of reference, as well as giving an extensive bibliography of about six hundred 
items of the type referred to in the first sentence. The price alone makes it unsuitable 
for recommendation as a text-book, but as a work of reference or item for the library 
of the specialist in problems of measurement it is to be highly recommended as a 
scholarly and thorough piece of work. 

The book begins with a philosophical discussion of the rationale of measure- 
ment and scaling, and in the introductory chapters the author develops definitions 
of three classifications within which the methods of scaling of psychological attributes 
may be deduced to fall, namely, the “ subject centred approach," the “ stimulus 
centred approach," and the '' response approach." He makes the point that they 
fall in a common category, not necessarily outside the field of most scaling in the 
field of natural science insofar as he rejects scales which depend upon " measurement 
by definition or fiat," in which an assumption of relationship between certain 
observations and the concept under consideration is required. The description of 
recent work requires eleven sub-divisions within these three categories, and he 
presents a chapter on each. The last three of these concern the ‘‘ response approach ” 
which incorporates and is mainly concerned with the scaling methods attributed to 
Guttman; and the account of recent developments in the field of latent structure 
analysis, including some of Dr. Torgerson's own work in the field of multi- 
dimensional scaling, is likely to be particularly useful in bringing the research 
worker up-to-date in this difficult branch. It is of interest that the author regards 
latent structure analysis as ‘‘a very general way of conceptualising the measure- 
ment problems encountered in the assessment of attitudes, abilities and personality 
traits.” D. M. EDWARDS PENFOLD. 
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INTERACTION OF HEREDITY AND ENVIRONMENT IN 
REGARD TO ‘ MEASURED INTELLIGENCE ' 


D. H. STOTT 
(Department of Psychology, University of Glasgow.) 


Summary. Recent work in experimental biology suggests interaction in the 
r form of the facilitation of genetic pre-disposition by adverse conditions of 
This may explain not only how certain environmental factors may 
gpa physical malformation in human beings, but also 

ings of the Scottish Mental Survey. 

The greater resemblance in ‘ measured intelligence ’ of monozygotic twins 
can no longer confidently be attributed to ‘ heredity.’ 

Interaction of this type presupposes a very great number of alternative 
modes of development. Congenital variations in temperament and motivation, 
by affecting the acquisition of mental skills, add further alternatives. 

The ‘ contributions ' of heredity and environment can, consequently, be 
the subject only of retrospective estimate in individual cases ; general quantita- 


tive estimates for a population are meaningless. 


gestation. 
produce mental defect. 
certain more general fin 


I.—INTRODUCTION. 
Discussion of the relative contributions of heredity and environment to 
"e reached a point where the value of the 


* measured intelligence ' seems to hav 

dichotomy has been called in question owing to the demonstration of a certain 

degree of correlation. This correlation, it was pointed out (Maddox, 1957), is 
Jligence of the parents is associated 


due to the sociological fact that high test-inte 
with high cultural stimulation of the child. Maddox also suggested, from the 
analogy of chicks, that the manifestation of a hereditary characteristic might ` 


^ be dependent upon post-natal environment. 
The object of the present contribution is to draw attention to a body of 
comparatively recent work in experimental biology which has demonstrated 
yet another and, indeed, far closer type of interaction, further complicating the 


heredity-environment dichotomy. 


II.—CONGENITAL MALFORMATIONS IN ANIMALS AND MAN. 


It has been known for à generation that the subjection of the pregnant 
female to certain exogen: able to produce malformations in the 


ous stresses was li 3 
young. But these were held to be^ phenocopies, * resulting from an arrest of 
embryonic development Similar to that produced by a genetic anomaly 
(Goldschmidt, 1938). The exogenous factors were thought of as alternative 
to the genetic, and hence the unconvincing attempts to classify individual 
malformations as either environmentally or as genetically induced. Among 
the supposed latter sort there was a notable failure to establish a Mendelian 
pattern of inheritance. Some were attributed to dominant, some to recessive 
genes, and some to both (Warkany 1947, Penrose 1951) ; but whichever pattern 
was held to operate did not do so consistently. However, this variable * pcs 
sivity' of gene influence was a common phenomenon in genetics, and was 


* Phenocopy ' is the term used for a physical anomaly 2 ing i i i 

" Ó y ly appearing in a rack 1 strain 
without any forerunner such as would suggest some form of Aer pao ad CUN is, 
nevertheless, found. to have genetic forerunners in other strains. It is thus said to bea 
* copy ’ of an inherited anomaly. 
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accounted for by two special explanations—that of the * phenocopy ' mentioned 
above, and ' penetrance,’ the latter being essentially no more than the statement 
that an apparently gene-determined tendency achieved morphological manifest- 
ation only in a proportion of cases. These terms bear more than a trace of 
‘explanation by label.’ Moreover, the ‘ copying’ implied in the phenocopy 
not only implies an agent, but perpetuates the concept of an agent distinct from 
the gene ; while the concept of penetrating some unspecified barrier goes beyond 
methodological parsimony. Both terms have been rendered superfluous by 
evidence that genetic and environmental facors may interact to produce the 
malformations formerly attributed to the one or the other. 

A wide range of unfavourable environmental factors during pregnancy can 
produce malformation in the offspring. The type of stress applied—chemical, 
vitamin deficiency, mechanical, irradiation—is immaterial so far as the type 
of malformation is concerned ; the latter seems to depend upon the stage of 
gestation, and as mentioned below, upon dormant hereditary propensity. 
Whereas species of animals vary somewhat in their susceptibility to different 
stresses, the same general tendency is found in all. For example, vitamin 
deficiencies have been made to produce congenital malformation in the chicken, 
turkey, duck, pig, cow, rat, mouse and guinea-pig (Giroud, 1954). No verifica- 


tion by controlled experiment is, of course, possible in man, and except for 


In fact, attempts to relate malformation to physical illnesses during pregnancy 
have been surprisingly unrewarding. Recently, psychological factors have been 
adduced )Ingalls, 1947, 1956, Stott, 1957, 1958). The sharp rise in the incidence 
of congenital malformations from the beginning of the war and in the early 
post-war years in Germany are attributed by German workers in part to emo- 
tional shocks and stresses. (Gesenius 1952-2, Klotz 1952, Grebe 1953, Grebe and 
Windorfer 1953). 

Despite disagreement as to which types of maternal stress are responsible, 
all authorities are agreed that the pre-natal environment plays a large part in 
the production of human congenital anomalies. Itis hard to account otherwise, 
for example, for the enhanced risk of mongoloid defect and of other malforma- 
tions with both advancing maternal age, and also in first births (Murphy 1947 
Record 1956, MacMahon and McKeown 1953, Alwyn Smith and Record 1955) 
for the higher incidence of anencephaly (lack of cerebral cortex) among first- 
births conceived during the summer months (McKeown and Record 1951) 
or for the association between mental defect and conception during hot summers 
(Knobloch). The terms ‘ congenital’ and ‘innate’ have consequently to 
embrace pre-natal environmental influence as well as heredity, 


could produce only 9-9 per cent., but in another strain, which showed 2-7 per 
cent. hernias even with a diet rich in vitamin A, the deficiency produced 19 per 
cent. Similarly, Fraser and his colleagues (1951, 1954), inducing cleft palate in 
mice, found that the incidence varied from 4 per cent. to 100 per cent., according 
to the genotype of the mother and the foetus. Thus, the impairment would seem. 
to result from a hereditary pre-disposition which was facilitated, that is, brought 
into morphological manifestation, by environmental stress during gestation: 
Facilitation constituted a unitary theory which explained both the undoubte 


| 
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tendency of certain malformations to run in families and their erratic and also 
sporadic appearance, without recourse to special explanations such as those of 
phenocopy or penetrance. 


III.—' INTELLIGENCE ' AND PRE-NATAL ENVIRONMENT. 


This new biological concept promises to be relevant to current thought 
about ‘ intelligence.’ There is a certain amount of evidence (Pasamanick and 
Lilienfeld 1955, Knobloch and Pasamanick 1956, Stott 1957, 1958) that human 
mental impairment is related to adverse pre-natal environment. This might 
well explain why the prediction of a decline in national intelligence was con- 
troverted by the Scottish survey. The fall of I.Q. with increase in family size 
observed therein has its counterpart in the higher incidence of several mal- 
formations in later-born children of large families. Anderson, Baird and 
Thomson (1958) found a correlation between fatal malformation of the central 
nervous system and adverse social conditions, notably in large cities. If the 
neural structures upon which mental ability depends are similarly subject to 
lesser impairment, we should expect a rise in the mean ‘ measured intelligence ’ 
of a population with improvement in social conditions. In effect, mental 
impairment due to adverse conditions of pregnancy would seem to explain 
certain trends in the statistics of the Scottish Mental survey that its authors 
were at a loss to account for. When the fall in test-score with family size is 
examined closely, it is seen that this was most striking among the ‘white collar’ 
groups, and within them at that family size—four or five—at which the latest 
addition may have been unwanted. The intellectual inferiority of children of 
mothers of 21 or less is also largely confined to the first four social groups. This 
can hardly be due to the greater risks of first-births, since the trend virtually 
disappears in the manual-worker groups. It has been estimated (Wimperis, 
1960) from the Registrar General's Statistical Survey that two-thirds of 
legitimate children born to mothers of under 20, and 24 per cent. of those of 
20—24, are pre-maritally conceived ; and a significant proportion of the children 
of the mothers of 21 or under covered by the Scottish Survey would, presumably, 
be illegitimate. Itis reasonable to infer that extra-marital conception would be 
more frequent among those young mothers belonging to the higher social 
classes, and the mental stresses associated therewith would be greater among 
them for cultural reasons. That children of one-child families living in grossly 
overcrowded conditions should be mentally inferior even to those of larger 
families in similarly overcrowded conditions again suggests the effects of 
illegitimacy. There is consequently some reason for inferring that the mental 
impairment following adverse conditions of pregnancy is not limited merely to 
that of the occasional production of mental defect (the ‘ ineducables' were 
excluded from the survey), but may apply to the merely sub-average. 

The hypothesis of the facilitation of genetic tendencies by environmental 
influences would also explain the failure of mental defect in general, and above 
all of feeble-mindedness, to conform to genetic expectations (Penrose, 1949). 
‘A further important study has recently been published by Brandon (1957), who 
followed up the children of female mental defectives who had at one time been 
inmates of the Fountain Hospital, London. Ninety-nine of the 109 children still 
alive were apparently mentally normal. Only six of the ten subnormals con- 
sistently scored below 1.Q. 70, and only four below 65. This is higher than would 
be expected in a general population, but probably about the same as would be 
found in a slum or other culturally disorganized community. Some three- 
f the children in question were born out of wedlock and in circum- 


o 
quarters fford the mother a trouble-free pregnancy or the child an 


stances unlikely to a 
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optimal post-natal environment, so that itis open to conjecture what proportion 
of the 9 per cent. subnormality was due to heredity. 

In a sample of 450 mentally normal children the present writer (Stott, 
1957) found that, after stressful pregnancy, one child in three suffered serious 
non-epidemic illness before the age of 3 years (compared with one in nine after a 
reported clear pregnancy). The stresses in question were similar in type, 
although probably on the whole somewhat less acute, than those followed by 
mental subnormality and/or malformation. The risk of either of the latter 
types of impairment following a stressful pregnancy can be roughly estimated 
on the basis of the incidences quoted at between 1:30 and 1: 40. This study 
must rank only as an exploratory one, but the findings are consistent with the 
theories of facilitation, and help to explain why mental subnormality is more 
prevalent in poor neighbourhoods. These findings add infantile ill-health to the 


“continuum of reproductive casualty ’ hypothesised by Warkany, Pasamanick 
and Lilienfeld (1955), Warkany (1947) and others. 


IV.—Twins. 
Facilitation has a quite particular be 


aring upon the twin-studies which are 
currently regarded as a means of iso]. 


c I : ating the genetic component of ‘ measured 
intelligence.’ Maddox (1957) has already suggested that identical twins, even 


though separated from near birth, will probably have more similar environments 
—by way of placement with relativ. 


apply to both. Furthermore, twinni 
stress and a greatly heightened foetal risk, as instance the high peri-natal 
mortality of twins. If, therefore, pre-natal environment can exercise a general 
influence upon mental development apart from the production of definite defect, 
this will also tend to contribute to the correlation of measured intelligence in 
separated twins. 

The Scottish Survey (1953) found that t 
points lower than for all non-twins (5-5 
inferiority of boys to girls among twins 
non-twins), which the authors of the rep 


he mean I.Q. of twins was 4:83 
5 points lower for boy-twins). The 
(3:06 points compared with 1-77 for 
ort comment upon with surprise, is in 


> was probable that the deficiency was 
more pronounced in the case of the monozygous, 


It is possible in short, that the observed higher reins -— 
intelligence’ between monozygous twins gher im of measured 
and non-twins may be accounted for, at least ; with heterozygous twi 
tion. Given a pregnancy-stress, impairmen 
genetic vulnerability of the foetus to the impairment 
seem to depend upon the genetic proneness of the 
presumably upon her acquired vulnerability thereto. Thus in monozygous 
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twins, not only the genetic constitution but also—subject to the qualification 
below—the foetal environment is similar ; for heterozygous twins, the genetic 
constitution differs but the foetal environment is by-and-large similar ; and of 
course, for non-twin sibs the genetic constitution again differs, while the pre- 
natal environment (same mother, different pregnancy) will vary more than that 
for fraternal twins (same mother, same pregnancy). The above is in conformity 
with the hierarchy of correlations found for children of the above groups reared 
apart. In addition, monozygous foetuses share the same membrane, whereas 
the heterozygous do not. Hence, one would suppose that for monozygous 
twins the pre-natal environment would be somewhat more similar than for 
fraternals. It is again interesting that the Scottish survey found that like-sex 
twins, both boys and girls, were inferior to unlike-sex twins. The difference is 
slight, but if it represents an inferiority of the monozygous, as Sandon suggests 
from a similar result in his own study, it must be multiplied by two, since these 
would only constitute about half of the like-sex twins. In summary, there would 
seem to be environmental factors making twins as a whole more similar in 
mentality than non-twin sibs, and monozygous twins more similar than 
heterozygous, however soon after birth the twin pairs are separated. It seems 
thus unfortunately not to be true, as Warburton (1958) avers, that in ‘ identical ’ 
twins Nature has provided us with the means to measure the genetic component. 


V.—THE INDIVIDUAL'S ALTERNATIVES OF MENTAL DEVELOPMENT. 


The value of the hypothesis of facilitation is that it suggests a process by 
which the interaction of heredity and environment can affect mental develop- 
ment. If it may be assumed that a significant part of the genetic influence is 
of the facilitateable sort, it is possible to envisage broadly the range of alterna- 
tives of mental development open to an individual embryo. At each critical 
stage of embryonic life, there will be different, genetically determined, prone- 
nesses to impairment. | Whether any of these are triggered off, to produce the 
impairment characteristic of the genotype, depends upon the intra-uterine 
environment at that particularstage. Itis also realistic to think of impairment, 
not as a lowering of a general ' level of intelligence,’ but as damage to the neural 
structures which make mental operations of certain sorts impossible. This does 
not necessarily mean that certain tasks are made impossible, since we know that 
similar tasks are often performed by different individuals by very different 
mental processes. Hence, the alternative process upon which an impaired 
individual falls back may be just as good, or nearly so, as the primary one. On 
the other hand, impairment in the form of an inability to form general concepts, 
or—as suggested below—of intrinsic motivation, may have no substitute and 
so result in poor general mental functioning. Whether the individual can, m 
fact, fall back upon alternative processes depends upon whether the environ- 
ment in which he grows up allows him to exploit the next most suitable alterna- 
tive, or the next after that, as the case may be. The developmental potential of 
of any individual can thus be likened to the path of a traveller who runs the risk 
of having to take diversions, each of which offer a variety of obstacles ; his 
ability to surmount any one becomes important only if he is forced to take the 
path upon which it lies. If he suffers from congenital night-blindness, and gets 
drowned by falling into a deep pool his demise might be attributed mainly 
to a hereditary factor ; but if he had been killed by treading upon an unexploded 
bomb in taking a short cut despite warning notices his death could have been 
said to be due mainly to environmental factors in the form of defects in his 
education or upbringing. Tf one likes to conceive of the roles of heredity and 
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environment in terms of quantitative contributions, it is consequently seen 
that the (pseudo-) ratio heredity /environment not only varies from individual 
to individual, but according to the tum which his development has taken. The 


value of a general estimate of the ratio, when the variance must be so large, 
surely becomes meaningless. 


VI.—Motivation as A DETERMINANT OF ' INTELLIGENCE,’ 
. Nor, unfortunately, have we exhausted all the complications. ' Measured 
intelligence ' is a sample of an individual's behaviour. Being such, it depends 
not only upon the possession of an aggregate of neural structures such as make 
certain mental operations Possible, but upon the extent to which the individual 
is actually motivated to perform them. This is not simply a question of how 
the subject feels at the time towards the test and the tester. For the whole of 
his life he will have been variously motivated to think thoughts of a certain 
type; he will have acquired Particular readinesses or mental sets, and the 
-formation—the systems by which he has 
n atively upon his congenital plus environ- 
mentally conditioned motivation, TEP d d 
,, Now, when we study the behaviour of mentally subnormal individuals 
without pre-conceptions about à self-operating quantity of ‘ intelligence,’ we 
observe that Impairment of motivation, alias * emotional factors ' (Clarke and 
Clarke, 1958), is a large element in their poor performance. In some, one sus- 
pects, failure or distortion of motivation (in very unresponsive or hyperactive 
defectives) could be the only factor, In such children nutritional motivation 
o ivi m HIS nation and appropriate response, but 
mental activity vanishes when it is no longer a question of og drinking ; 
ashlao A 24,4. prefrontal lobotomy (Brazier 1¢ 0, 
Lashley 1935). Other defective children, although active, have no interest in 
i a Speech or social adaptation. A good 
or Saat iia of any group of educationally subnormal children suffer from 
unforthcoming temperament, alias under-motivation. Other things being 
E T poor concept-formation, since the child would not 
p d ; he mental operations upon which conceptualization is based. 
aget's ) terminology, there Would hea shartaes af the anticipatory 
2 re Ously existing classifications = a“ 
ion in this building J^ ral failure to appreciate the importance of motiva- 
Hon m ea building up of the mental equipment is due D the conventional 
paration of the Cognitive and conative fields 


: Q9 di. 
her it be ' unforthcomingness, dis 


ity to concent 4 al stimuli or 
other abnormality must there rate), lack of response to persona 


! ; ; fore be recko; onent of ' measured 
facilites In so far as it is Congenital, EA ameer AE that it can be 
acilitated by pregnancy-stress in the same way as can cognitive disability 


(Stott, 1959). This adds a third dim 


oer i ; ossibilites of 
an individual's mental development. ^" of alternatives to the p 


VIL—Is ‘Iyn 

At this point it may be ; 1 hat 

À well, in summarizing the argument, to see wha 
remains of the Concept of relative Contribution vil it is translated into the 
language of interaction. The contemproary view of genetic determinants is 
that they are built-in Instructions ’ for the development of the organism within 
the range of environment for which it is fitted, To ask whether given individual 
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ue to the variability of the instructions or to that of the 


variations are more d 
environment is not toask one question, but a thousand or a hundred-thousand— 


according to where we stop following the alternatives. It would, no doubt, be 
feasible, in dealing with an individual at a given stage of mental development, 
to study retrospectively which of the alternatives of interaction were in fact 
followed, and so be able to say, of that individual that heredity or environment 
played the major part. For example, it might be said of a child of an academic 
family that his mental development was greatly stimulated by his environment, 
and that if he had been born in an unacademic family he would have remained 
intellectually undistinguished ; Or of a child suffering from a congenital mental 
hat no matter how good the environment his mental development 


impairment t 
would always have remained low. But to try to assess either of the two types 
of influence as percentages, even in individual cases, would be merely rhetorical 


quantification. 1 : 
driven to the point of recognizing that we are not asking 


Moreover, we are c [T 
an answerable question. If an organism failing to solve a problem by one sort 
;« free within certain limits to utilise other skills; if the 


of mental behaviour is f 1 ; 
relevance of each of the alternative skills depends upon the type of problem ; if 


the primary mode of solution, or the calling into play of the alternative 
approaches, depends upon the strength and quality of the individual's motiva- 
tion ; if the skills themselves owe their existence in large part to the motivation 
which has instigated the skill-forming practice—then we are forced to question 

the resultant individual difierences in mental behaviour 


the value of subsuming the, a du : beh 
' intelligence. Our ability to measure specified individual 


within a concept of ; i T s 
differences does not justify us m assuming that there 1s any unitary factor 


underlying them, which can be conceived of as a quanitity. In our term 
‘intelligence ' we have in fact taken over the popular and traditional reification 
of these individual differences, which exists not in the mind of the subject, but 
in that of the observer. Theoretically speaking, the term * measured intellig- 
ence ' does not save us, for the attempted measurement of an animistic concept 
no more conjures it into reality than two wrongs logically ever make a right. 
By extension, it is also impermissible to pose queries as to the relative quantita- 
tive contributions of heredit ; and environment to mental efficiency unless we 

nd in which person, and even then the question only 


specify efficiency in what @ 
males sense in broad clinical assessment of etiology. 
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THE TEACHING OF PSYCHOLOGY IN TRAINING 
COLLEGES 


By K. G. COLLIER 
(Bede College, Durham.) 


Summary. The major function of a psychology course for student-teachers is 
to cultivate the student’s insight into the thoughts and motives of ordinary 
children. A brief account is given of various methods used at a particular 
college for this purpose. Re is is also made to the problem of ensuring 
an effective discipline of students" thinking on the academic content of the 


course. 


[.—INTRODUCTION. 
Ix 1960, a three-year course of professional education for teachers comes into 
existence in England and Wales. The prospect provides à stimulus for the 
reconsideration of purposes and methods in the teaching of psychology in this 
setting. There appears to be little previous work available on the teaching of 
al, for for student-teachers, or in particular for non-graduate 


psychology in gener r U € 
teachers. Oeser, in a stimulating article,? argues that psychology should be 
1 principles : for 


taught by methods which themselves illuminate psychological 
example, the students’ experience of working in small syndicates enables them 
iate the social facilitation of academic learning. In the clinical field 
Millar and Balint, in two perceptive papers,® advocate a close inter-weaving 0 
theoretical study and practical acquaintance with patients. A recent mono- 


graph from Keele’ discusses in an enlightened manner some problems and 
methods in the teaching of the psychology of personality development to 
teachers and social workers. Valentine's small book5 has served for many years 
as a basis for laboratory work to complement formal lectures and discussions ; 
its orientation, however, is tow 


ards a somewhat academic study of principles 
of learning. In any case a number of new 


techniques have become available 
since it was written. The above sources and some others? have been considered 
in working out the metho 


ds described below. —. 
Tt is as well to recognise, as à starting point, that the word ‘ psychology , 
has itself a great variety of connotations in t 


he educational world, some gross 
misuses of the word probably being a useful guide to the prejudices of the 
nce said to me, “ I get sick of all 


miens. A student of mine, for example, once "s 
this psychology in the classroom ; why can't you just give a boy a clout or à 
Instruction in Psychology. New 


Improving Undergraduate 


1 But see D. WoLFLE, êt al., 


York. : Macmillan, 1952. n 
20. E. OESER (1951). Psychology applied to 
uarterly, Vol. 5. à i i 

s w. M. MILLAR, M. BALINT and G. P. MEREDITH (1954). Symposium on * Method and 
Technique in the teaching of Medical Psychology.’ Brit. J. Med. Psyahohy 37; 


the teaching of psychology. Universities 


30-47. A : 

4 The problems arising from the teaching of personality development. Sociological Review 
‘Monograph No. 1 (1958). j I 

‘An Introduction to. Experimental Psychology. University 


5C. W. VALENTINE 
Tutorial Press, rd ed. ' i 
«M. L. JOHNSON, (1952). Teaching by free group discussions. Universities Quarterly, 
Vol. 6. ; 
. MACCALMAN (1953). Observations on the teaching of the principles of mental 


health to medical students. Brit. J. Med. Psychol., 26, 140-151. 
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telling off when he needs it and get on with the job?" To him psychology meant 
never giving direct orders or punishments but getting children to do what one 
wants by subtlety. I have heard teachers speak as if psychology meant on the 
one hand intelligence testing and on the other Freud's theories of sex. Such 
notions represent in my view serious misconceptions of the bearing of psychology 
on education : fundamentally it should be concerned with understanding people, 
ordinary people, the ordinary children in the classroom and the playground. 
To me, psychology for the teacher means primarily developing that immediate 
intuitive insight into how a boy or girl is thinking or feeling, without which no 
good teaching is possible. 


things. On the one hand one needs that sixth sense a good teacher acquires for 
nosing out the particular muddle or misconception a child has fallen into. On 


standardized tests of ability and attainment, inquiry into home background and 
into relationships with other children and teachers in the School, and so on. 
One needs to understand children's reactions, both superficial and deep-seated, 
individual and collective, to different types of discipline; whether—for 
example—quietness in a particular class means a co-operative attitude, or 
submerged resentment, or sheer cowed submission to the teacher's power. 
One needs to appreciate the Tesponses to different kinds of leadership. One 
needs furthermore to be able to see the long-term influences of one's relations 
with individuals and classes, and of the generallife of the school. Much of one's 
insight into all these things depends on one's self-knowledge, on one's apprecia- 
tion of one's own make-up and development, and especially on one's attitude to 
one's own weaknesses and childish failings. A teacher who always treats untidi- 
ness with severity has failed to realize that his own latent tendencies to untidiness 
are probably held in check by a somewhat authoritarian conscience or super-ego. 

A major problem for the education lecturer in a training college, then, is 
how to cultivate such insight in his students. how to open their eyes to what is 
happening in the children jn front of them. Some people think it cannot be done 
at all; one has much or little of the gift by nature and it cannot be improved 
by training. , Others think it can be taught, and that lectures and discussions 
are an effective instrument. My own opinion is that it can be improved by 
tutors but that lectures and discussions cannot in themselves do much. There 
are doubtless a great many different methods and the following is an account 


IL—MzrHops USED ron DEVELOPING STUDENTS' INSIGHT INTO CHILDREN. 
(1) Pre-entry observation. We arrange for students to Spend a fortnight 


students find the latter far more difficult than the former. Even a rather 
detailed series of questions means little, apparently, to an adult of 20 who has 
no standard by which to judge—for example—whether children’s groups are 
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* stable ' or ‘ unstable, or how far a child is ‘ dependent’ on the approval of 


adults. 

. (2) Films. We use a variety of films, from the Ministry's special training 
college films to a Russian account of conditioned reflexes and Slavson's record of 
the group-treatment of maladjusted ten-year-old boys." Such films provide 


excellent concrete experience to be shared by tutors and students; they need 
d up with discussions in small 


to be prefaced by an introduction and to be followe 
groups. Some of the most profitable films we have come accross are à couple of 
U.P.A. cartoons. The length is only ten minutes, the drawing and scripts are 


delightfuly witty, and psychological principles are faithfully illustrated. One 
can easily repeat the showing after discussion. 


(3) The tape-recorder is a useful instrument. One can record children 


talking together and discuss the interpretation of the conversation. One can 

select fragments from recorded lessons to illustrate various methods of exercising 

authority and play them back in a lecture. Onecan record a student’s lessonand 

discuss both his strong and his weak points and their origin and significance for 

him asateacher. This is best done in a tutorial group where the man’s colleagues 

can help him—in the right atmosphere—to look at himself more candidly. I 
as yet unexplored uses of the tape. 


am sure there are many 

4) Literature. One can discuss selected films, novels, plays, biographies 
etc., with a group all members of which have shared the experience. We have 
used such films as The Bridge on the River Kwai, The Third Man and The Grapes 
of Wrath ; and such novels as David Copperfield and Sons and Lovers. These all 
provide excellent opportunity for discussion of the springs of human motive. 


(B) Self-Study. We also value highly, as a means of encouraging students 
to embark on the long job of understanding their own make-up, the setting of an 
essay on their own development immediately after the set lectures and discus- 
sions on the development of personality. We ask specifically for a report of 
500-1000 words on some of the individual student’s own chief attitudes and 

as acquired through identifying 


values, referring particularly to those he hi 
himself with a loved h he has been drilled by someone m 


dult, and those in whicl 
authority. We warn them, of course, that we have no wish to pry into private 
understanding of life and people. 


affairs. Many discover in this exercise a new 
torial system one could get more from these self-studies 


With an adequate tu 
than we usually do. 
(6) Learning activities. The great majority of our students have been 
through years of dry academic instruction ; they have very little experience of 
They therefore have the greatest difficulty 


any method of study put ‘ swotting.’ 
in grasping what one is talking about when one says that historical episodes can 


be made more vivid to children by dramatization, and yield more insight ; or 
that concepts are useless unless they have an adequate background of concrete 
re, give them a series of miniature learning activities. 


Qr We di We, therefo 
a e divide a discussion group into four sub-groups and give them i 
of unfamiliar history to learn by four different fou ue; dd ce 


race-games ; and swotting. We test them for factual knowledge a few weeks 


able from Geneva via British Committee for Int 

e erchang 
strators, 26, Bedford Square, London, W.C.1. m 
49-150 Wardour 


7 Activity Group Therapy, obtain: 
of Social Workers and Admini: 


3 Family Circus and Willie the Kid, available from Wigm hne 
Street, London, W.1. igmore: Bims, d 
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later and show that there are negligible differences. Readers will recognize 
this scheme as based on Northway.? (b) We give them a series of ten-minute 
studies to do in pairs; learning an Ethiopian vocabulary by means of race- 
games; making comparative observations on specimens of twigs ; classifying 
three dozen pictures post-cards into six categories ; drawing conclusions from a 
series of historical or geographical pictures; finding answers to questions on a 
play or a chapter of the Bible. (c) We require all students to make a model 
from scrap materials, on some academic topic and to state what academic matter 
a child should learn through the process of making the model. Thus, they can 
begin to appreciate the great variety of approaches to study and learning, and 
the point of one's general statements on the psychology of learning. 


(7) Role-playing. We use role-playing, or the acting out of general 
principles in terms of dramatized episodes, for pedagogical principles rather than 
psychological, but the two overlap. For example, when I deal with principles 
of authority in schools, I suggest that if a teacher wishes to establish his pupils’ 
confidence in him he must show that he really cares about individuals in trouble. 
A small group of men then perform a playlet. In the first scene a conventional 
tvpe of teacher exhorts a troublesome boy to pull himself together or he will 
get drastic treatment. The second scene is of the boy arriving home for tea and 
finding nobody at home but an older brother, who is just going out. Dad is on 
overtime, Mum has just got a part-time job as a barmaid, there is no tea and no 
fire, and when the older brother goes out the boy exclaims in bitter despair, 
“ Nobody cares a damn for me." Besides the pedagogical question of the 
teacher’s treatment of the boy, there is the psychological question whether it is 
generally true to say that a troublesome boy is reacting to bad relationships at 
home. Here such evidence as that of Bowlby and the Gluecks becomes relevant. 
In our experience, the role-playing approach makes a sharper impact on the 
student’s imagination than a general discussion. 


(8) Stwdy-practices. By far the most valuable method we have worked 
with is a system of group practices which we call study-practices, since they are 
essentially an aid to study; the study of principles of psychology and of teaching 
method. Under this system every student in his second year works every week 
for a double or triple period as one of a group of a dozen or so students under a 
tutor in a class of children. I have a group doing science in a secondary modern 
2A class, and another group studying film and radio with a 3B secondary modern 
class ; we have groups taking secondary modern Geography, English and Mathe- 
matics ; and groups in junior schools doing Reading and Number Work, Art, 
Music and Nature Study. There is some class teaching but the greater part of 
the work is carried out in groups of two men with four or five children, There 
are weekly tutorials in the college, in which the next programme is planned 
and results discussed. Men get to know their own special pupils well and, by 
arrangement with the Head and the parents, visit their homes. In this work 
we find the ideal opportunity for relating concrete experience and problems to 
general principles: for example, the characteristics of individual children in 
relation to those of their age-group; the relation of a child’s personality to his 
home; the responses of children to different teaching methods; the adminis- 
tration and interpretation of standardized tests. Students write a report at 
the end of the year which is the major written work of the second year and 


? M. L. Nortnway (1940). The concept of the ‘schema.’ Brit. J. Psychol., 30, 316-325 ; 
31, 22-36. : 
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includes two child studies. At present we get about twenty weeks in the year 


for study-practice ; in the three-year course it will probably be expanded.'^ 


III.—SvsTEMATIC EXPOSITION. 


I have described some of the methods by which we 
the student's insight into human motive and thought. 
systematic in character, and often to depend on the tutor's spotting the sig- 
nificance of particular incidents, thrown i 
general concepts. But we do also make a systematic exposition of theory in 
mass lectures accompanied by tutorial discussions. We take the usual topics : 
the stages of development of children ; the psychology of learning ; the nature 
of intelligence ; the use of psychological tests: the development of personality, 
including especially the acquisition of attitudes and values; the psychology 
of authority and leadership ; and that of expressive behaviour, such as drama 
and painting. We shall certainly maintain these in the three-year course. We 
consider that the element of systematic exposition is important and we are, 
think, generally scrupulous in presenting balanced evidence, with statistical 
corroboration where possible. But we do not give so much time to these aspects, 
and we refer to scientific method and conceptual definitions in passing rather 
than in separate sections of the course. In other words we regard our work as à 
technology rather than as a pure science. Incidentally, we pay considerable 
attention to our techniques of examination and to elucidating for the students 
the criteria by which we mark the scripts. We stress the weight given to quali- 
ties of insight, judgment and perspective, and to evidence of some interaction 
between abstract thinking and personal experience in the student's mind, and 
we are experimenting with questions which will encourage reflection on experi 
ence rather than memorization of second-hand ideas. 

Our time allocation varies during the two years. In the first year a student 
has two formal lectures (to a whole year of 200 men) and two discussion periods 
a week for Education, about a third of the total being given to practical work 
such as I have described. In the second year Education course a student gets 
one year-lecture and one discussion linked to it, together with a double period of 
Study Practice and an associated discussion. A little under half of the second 


year work is practical. 


IV—SoME DIFFICULTIES. 


It will no doubt be remarked that the success of most of this work depends 
on the size of the tutorial groups. In a recent investigation it was found that in 
approximately half the colleges studied the ordinary working groups for 
Education were between twenty-five and forty in size. When I compare these 
numbers with those in academic subjects where tutorial groups in the same 
colleges may contain perhaps ten—t 
the most important single need for the teaching of psychology in the three-year 
course is for a reduction in the size of the normal working groups in Education 


departments to the standard of the aca! 
enable one to set more written work for tutorial discussion. 


10 See L. W. DOWNES, Bulletin of Education, May, 1953. 
K. G. COLLIER, Education for Teaching, May, 1955. 
J-J: FIGUEROA, Post-graduate teacher education ; some experiments in the U.S., 
Universities Quarterly, Nov. 1958. 
. VERNON, Post-graduate training of teachers in psychology- Brit. J. Educ. 
Psychol., 1950, 20, 149-152. 
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A second difficulty is getting students to read effectively. An incentive in 
the shape of a degree or diploma would probably stimulate more reading: the 
number of books borrowed from the college library always rises before an 
examination. But it is the quality of reading that troubles us: we find little 
evidence of the student making any strenuous or imaginative effort to inter- 
weave his personal experience and his book-reading, except in the context of the 
Study-Practice. It may be that this kind of reading only begins when a student 
gets ‘ bitten ' with a subject and that the real problem is to provide conditions 
in which this is likely to take place. Even then he probably needs training, 
particularly in view of the scarcity of books written at his level. 

A third difficulty in the way of the teaching of psychology is an ethical one. 
The successful understanding of a problem in physics or geology depends 
largely on high ability and intense interest. One analyses the phenomenon and 
speculates on possible interpretations ; one then checks one's hypothesis. One 
can do something similar in diagnosing the causes of a child's backwardness in 
reading. But when it comes to dealing with problems of behaviour and discipline 
one is on different ground. A 14-year-old boy, we may imagine, is off-hand in his 
manner : one teacher sees in this a scarcely disguised impertinence ; another a 
defensiveness springing from uncertainty of his own status ; a third a laudable 
assertion of his independence. And the difficulty of checking the hypothesis 
derives not merely from the absence of sufficiently delicate tests but, from the 
fact that the teacher's personal, deeply-rooted values are involved. In the first 
teacher the interpretation may be due to the suppression of his own rebellious- 
ness by a punitive conscience ; in other words he has learned to set a compulsive 
value on prompt deference to authority, The second for some reason has come 
to attach a greater importance to the under-dog's point of view ; the third 
attaches a positive value to an individual's self-assertion irrespective of its 
cause. When we are engaged in cultivating the student's insight into people, 
we are unavoidably involved in the possibility of altering his personal values and 
outlook on life. That is a serious undertaking and a considerable responsibility. 
It demands rather a high degree of personal integrity and detachment. Perhaps 
after all it is ' safer’ to teach Psychology through formal lectures and discus- 
sions which cannot get under anyone's skin, 


(Manuscript received December 4th, 1958 ). 


A STUDY OF THE CONSERVATION OF SUBSTANCE IN 
THE JUNIOR SCHOOL CHILD 


By K. LOVELL anv E. OGILVIE 
(University of Leeds Institute of Education.) 


Summary. Almost all the children in a Junior school were tested individually, 
along the lines initiated by Piaget, in an effort to trace the development of the 
invariance of substance, and to establish the arguments used by 
hree stages in the development of the 
ed, and some children give reasons 


concept of 
children to justify their answers. The t 


concept given by Piaget have been confirm 
for conservation similar to those proposed by him. On the other hand, our 
evidence does not always agree with that of Piaget, nor does it enable us to 
prove or disprove the assumption that the child arrives at the concept of 


conservation because he is able to argue logically in concrete situations. 


I.—INTRODUCTION. 
or invariance of substance is an important one. 


A quantity such as a lump of plasticine, a collection of beads, a length or a 
volume can be used by the mind only if it remains permanent in amount and 
independent of the re-arrangement of its individual parts. This notion of 
invariance is indeed essential to any kind of measurement in the physical world. 
Piaget, using many ingenious experiments, set out to determine if the concept is 
there in the child's mind from the start or whether it is gradually built up. 

It is now becoming generally known that, in Piaget's view, a child's thinking 
is largely dependent upon perception from 4-7 years of age. During this period 
thinking tends to be determined by the child ' centring ' on one aspect, dimen- 
sion or element of the situation with other aspects ignored. But, from 7-8 years 
of age he is able to break away from the influence of perception and is increas- 
ingly able to apply logical thought to practical problems and concrete situations. 


Piaget maintains that the concepts that figure in logical thought result from the 
co-ordination of actions in which the child combines, dissociates, orders and 
sets up correspondences. The child gradually appreciates the significance of his 


actions so that they acquire the form of reversible operations in the mind and 
thus render his thinking less dependent upon perception. J 

From his many experiments involving the conservation of continuous and 
discontinuous quanities, Piaget maintained that children passed through three 
stages; namely, that of non-conservation, transition and conservation. In a 
typical experiment, which we have repeated, Piaget or his students would show 
a child two balls of modelling clay of the same size, one of which is then shaped 
tolooklikea sausage. At the first stage the child will deny that the amounts of 
clay in the ' sausage ' and ball are now the same. A response such as “ Its more 
because the sausage is longer,” might well be found. At the transition stage, he 
will arrive at the idea of conservation under some conditions, or at one moment 
but will lose the idea again under slightly changed conditions ; while from 7 $ 
years onwards it is said that the child feels the logical necessity for conservation 
and will support it by argument. For example, Piaget (1950, p. 140 and 1953, 
p. 16) maintains that the child will say that the sausage can be returned to the 
a ball, or what has been lost in one direction has been gained in another, 
othing has been added and nothing has been taken away. 


THE concept of conservation 
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However, we cannot say how many children were examined by Piaget and 
his students in this type of experiment, and it was decided to test a group of 
English children under somewhat more controlled conditions. 


II.—EXPERIMENTAL PROCEDURE. 


This study reports one main and two subsidiary experiments, the latter 
being of use in clarifying certain points. In the main experiment almost every 
boy and girl in a junior school in a North of England town was tested individually 
in an effort to trace the development of the concept of invariance of substance, 
and to establish the arguments children use to justify their answers. Balls of 
plasticine were used of diameter about two inches. Providing the child agreed 
that the amounts of plasticine in the balls were the same to begin with, the 
subject was considered to be suitable for the experiment. One ball was then 
rolled out to make a ' sausage ' and at the same time the child was questioned. 
All the experiments took place in the Headmaster's room so that no child wit- 
nessed the cross examination of another, and the children's replies were recorded 
verbatim. The procedure which was followed is now given, although it must be 
stressed that many supplementary questions were asked, and our outline 
indicates the general form of the procedure and not its exact details. To get the 
child to give his reasons, we found it necessary to make our approach a flexible 
one. 

APPARATUS.—Six balls of plasticine, two equal in size, the others clearly 
different, and all six of different colour. 

TrcHNIQUE.—Ask the child to choose two balls of plasticine which are the 
same. Say—Here we have six balls of plasticine. Can you choose the two that 
are the same ? 5 

If the child replies that they are all different say—True, in colour they are 
different but I’m thinking of size, of amounts of plasticine. Let us forget the 
colours ; now, which two are equal ? 

Child having made the correct choice, say—Now you have one ball and let 
me have the other. Will that be fair ? 

(1) Will I have as much plasticine as you? The reply must be Yes before 
the experiment continues. Equality having been admitted, say—I am now 
going to roll your ball into a sausage.* Watch me. Action performed. So now 
you have a sausage and I still have a ball. 

(2) Who has the most} plasticine now ? 

(3) Why do you think so? What makes you so sure? 

(3) (a) If the sausage is longer why doesn't it have more plasticine in it? 
Or, How can it be longer and still the same? In the case of ‘don’t know’ replies to 
(3) (a) the child was asked: What happens as I roll the ball? Invariably the 
reply was It gets longer. The child was then asked Anything else? and, if 
necessary, Does it get fatter ? 

This was the end of the experiment for those clearly conserving. For non- 
- vas say—Watch carefully —I'm rolling your sausage longer still . . . longer 
still. 

(4) Now who has most plasticine ? 

(5) Why do you think that? 

(6) Very well, watch carefully. Let's roll it some more . . . some more. Now 
who has most plasticine ? 


* Sometimes the child did the rolling but more frequently it was done by the experi- 


menter. 
+ We have used the word ' most’ instead of the words ‘the more.’ Although gram- 
matically incorrect, we think that the children better understood our question using ‘ most." 


Ls 
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(T) Why do vou think that? Point out a second aspect of the dimensions— 
length or breadth—which the child is ignoring by asking the following kind of 
question. But my piece of plasticine is fatter. Does this mean that I have more 
or less plasticine than vou ? Pursue this point with similar type questions. The 
primary aim here is to force the child to consider two dimensions simultaneously 
and see what changes in thought occur—if any. 

(8) Let's roll your sausage back into a ball. Action performed. Who 
has most plasticine now ? 

(9) Who will have most plasticine if we roll vour ball out again into a 
sausage ? 

(10) Now watch carefully... I am rolling my ball into a sausage. Action 
performed at same time. Who has most plasticine now ? 
(11) Why do you think so? What makes vou so sure? 

The first of the subsidiary experiments was carried out to see if an under- 
standing of addition and subtraction of small pieces of plasticine inevitably 
produces the concept of conservation. If this is so, then any child who can see 
that adding / subtracting a piece of plasticine makes a ball more / less, should 
maintain his conclusion without regard to subsequent spatial transformation. 
Thus, a piece of plasticine was removed from a ball and the child was then asked 
what he thought about the amount of plasticine in the ball. The piece removed 
was left in the clear view of the child. The experimental ball was then subjected 
to a spatial transformation and the child questioned again about conservation 
of plasticine. The following numbers of children were involved : thirty-five at 
the stage of conservation as found from the main experiment, forty-three at the 
stage of transition, and thirty-eight at the stage of non-conservation, making 
one-hundred-and-sixteen pupils in all. Some of the children were subjected to 
this test at the time of the main experiment. . 

The second subsidiary experiment was designed to test the hypothesis that 
a child at the stage of non-conservation in the case of a plasticine ball, is not 
inevitably a non-conserver in another situation involving quantity; in this 
case a non-conserver of rubber in a rubber band. Thus, a rubber band was 
shown to the child ; it was then stretched, and he was questioned as to whether 
or not there was the same amount of rubber in the band as before. Forty-eight 
children all at the stage of non-conservation in the main experiment undertook 


this test. 


IIL—STATEMENT OF THE RESULTS. 

Main EXPERIMENT. ; 
The three stages in the development of the concept of conservation of 
substance have been found. Table 1 shows the number of children at each stage 
in each of the year groups. It was difficult at times to classify correctly a child 


at the transition stage. 


TABLE 1 
NUMBERS OF CHILDREN IN Each GROUP AT VARIOUS STAGES. 


Non- 


Average Age | Conservati ope — : 
Yr. Mo. Conservation| Transition Chnsercation Total 
Ist year ....| 7 8 30 27 z = re 
2nd vear .. 8 10 44 8 65 
Zid year «44 9 9 73 15 | 99 
4th year .... 10 8 64 7) 75 
211 57 54 322 
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Table 2 shows the percentage number of children at the conservation stage 
giving various reasons for conserving. Some children gave, spontaneously, 
more than one reason ; and in order to make the meaning of the words used in 
the table as clear as possible, definitions are first given. 

Reversibility —It is difficult to be sure exactly what Piaget means by 
‘ reversibilité.’ Bunt (1950, p. 28) thinks he means that the child must be able 
to picture to himself the ‘sausage’ returned to the original ball. We were 
unable to keep to this exact meaning for reversibility. Our criterion is that the 
child gave evidence of his awareness of the equality of the balls at the beginning. 
The child says in effect, “ They were the same before." Of the conservers only 
five children spoke of rolling the sausage back to a ball which would be identical 
to the other ball, or pointed out that after rolling back it would be as it was 
before being rolled out.* 

Identical action.—Child says, in effect, that the ball held by the experi- 
menter can be rolled out into a similar sausage. 

Plus/Minus.—Child says, in effect, that nothing has been ‘ added to’ or 
' taken from ' when the ball was rolled into a sausage. 

Co-ordination of relations.—Child says, in effect, that although the sausage 
is longer it is thinner. Bunt (op. cit.) criticises Piaget for the use of the term 

‘ multiplication of relations,’ and we have kept to the term ‘ Co-ordination of 
relations in compensatory fashion.’ It appears that, for Piaget, the word 
‘multiplication’ means what is sometimes called ‘logical multiplication,’ 
i.e., two factors simultaneously affecting a third. In this case it is height and 
cross section jointly affecting volume and he seems to imply something that can 
be expressed in units. We are not able to keep to Piaget's meaning at this 
point, if we interpret him correctly. 

Weight.—Child refers to the fact that the weights of the sausage and ball are 
the same. The assumption is that the weights of the two balls were the same 
originally. j 

Shape.— Child says, in effect, that the lump of plasticine has only undergone 
a change of shape. 


TABLE 2 


PERCENTAGE OF CHILDREN IN EACH YEAR Group GIVING VARIOUS ANSWERS FOR 
CONSERVATION. 


] r Co-ord- 
Reversi- Identical Plus/ ination Don’t 
bility Action Minus of re- Weight Shape Know 
lations 


(30 
14 


We now analyse the replies of the children at the stages of transition and 
non-conservation. Table 3 shows the number of children in each year group who 
gave clear evidence of ' centring ' on one dimension. The criterion was that they 
replied, It's more because its longer ” or some essential variation of this at 
some point. It must be stressed that others might well have ' centred ' on one 
aspect but we have no clear evidence of it. 


* [t will be seen later that some children at the other two stages spoke of rolling the 
sausage back to a ball which would be bigger/smaller than it was originally. 
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TABLE 3 


NUMBER OF CHILDREN IN EACH YEAR GROUP AT 
SrAGES WHO GAVE EVIDENCE OF * CE 


‘TRANSITION AND NON-CONSERVATION 
NTRING' ON ONE DIMENSION, AND OF A CHANGE OF 
' CENTRING.” 


Centring Change of Centring 
s : 


Non-Conserva- Non-conserva- 
Transition tion ‘Transition tion 


Ist year 2 
2nd year .... 
3rd year 
4th year 


If a child judges by look alone, the aspect of the total situation on which 
he is ‘ centring ' may change. The sausage is bigger one moment and the ball is 
bigger the next. There is no reason why the child should not switch his 
‘centring,’ and if he is not conserving then his opinion might change. The 
criterion was that the child said that a particular object was, say, “more * 
at first and later ' less,’ without any prompting or forcing by the experimenter ;* 
that is before stage 7 in the general procedure. Again, other children might 
have changed their ‘centring’ from one aspect to another but we have no 
evidence of their so doing. Table 3 also shows the number of children at the 
transition and non-conservation stages who changed, without any prompting or 
forcing, the aspect of the situation on which they ' centred." 

At stage 7 in the testing procedure the child was forced to observe two 
Itaneously. Table 4 shows the number of children in each year 


dimensions simu ; L l r 
group who give clear evidence of being able to consider two dimensions together. 
iterion was that, in essence, the child made reference to two dimensions 


The cri 
in compensatory fashion, e.g., Its longer put its thinner." Table 4 also shows 
how the children reacted. Those at the transition stage are sub-divided into 


those who conserved to the end of the experiment and those who conserved for a 
while but reverted to non-conservation before the end. 


TABLE 4 
JACH YEAR GROUP AT TRANSITION AND Non-CONSERVATION 


NUMBER OF CHILDREN IN E: 
SrAcEs WHo GIV EVIDENCE OF ATTENDING TO Two DIMENSIONS. 


= 


TRANSITION Non-CONSERVATION 


Can consider two | Can consider two | Can consider two 

dimensions and con- dimensions and tem-| dimensions but fail 

serve at end of ex- | porarily conserve to conserve 
periment 


Ist year 
2nd year 
3rd year 
4th year 


* [t must be realised that the fact that the child undergoes the experiment at all, does 
to some extent force him to think. D S 
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The study of reversibility at the transition and non-conservation stages 
is of great interest and importance. Table 5 shows the number of children in 
each age group who gave clear evidence of reversibility as we have defined it. 
It will be noted that reversibility does not necessarily produce conservation in 
spite of the fact that it is given as a reason for conservation among conservers. 


TABLE 5 


NUMBER OF CHILDREN IN EACH AGE GROUP AT TRANSITION AND Non-CONSERVATION 
STAGES WHo Gave EVIDENCE oF REVERSIBILITY. 


Transition Non-Conservation 


Ist year 5 
2nd year ........ 10 


3rd year LN 9 
3 


Thus, seventy children can show reversibility as we have defined it, but 
draw no conclusion from it and remain at the non-conservation or transition 
stages. 

In addition, two children at the transition stage and four at the non- 
conservation stage showed clear awareness of the possibility of returning the 


sausage to a ball, but claimed that the ball would be bigger/smaller than the 
original. 


SUBSIDIARY EXPERIMENT 1. 


A small piece of plasticine was taken from the ball. All the children tested, 
except two, agreed correctly that there was then less plasticine in the ball. 
The plasticine was then subjected to a Spatial change. The thirty-five children 
who were conservers in the main experiment remain conservers, while Table 6 
shows the effect of the experiment on children at the transition and non- 
conservation stages. 


TABLE 6 


NUMBERS OF CHILDREN ATTAINING CONSERVATION AFTER THE SUBTRACTION OF A PIECE 
. OF PLASTICINE. 


TRANSITION Non-Conservation 


. Conserved or " Conserved or 
Examined almost so Examined almost so 


Ist year T X 17 10 16 1 0* 


2nd year ... a 5 10 
3rd year " . 14 
4th year — 7 


* Two pupils seemed to move some way towards conservation. 
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SUBSIDIARY EXPERIMENT 2. 

In this experiment we tested to see if non-conservers in the main experi- 
ments were also non-conservers in another situation involving invariance of 
quantity—in this case, of rubber in a stretched rubber band. The results are 


given in Table. 7. 
TABLE 7 


NuMBER OF NoN-CONSERVERS (MAIN EXPERIMENT) IN EACH AGE GROUP AT VARIOUS 
SrAGES WHEN CONSIDERING THE QUANTITY OF RUBBER IN A RUBBER BAND. 


Number 
Conservers Undecide ———Ü 
tested ecided Non-con: ers 


Ist year 
2nd year ... 
3rd year 
4th year 


26 19 
11 7 
7 $ 2 
4 0 


48 28 


TV.—DISCUSSION OF RESULTS. 


Strong evidence has been produced in support of the three stages proposed 
by Piaget, and in our view, he was justified in trying to trace the development 
of the concept of invariance of substance. But the stages are not clear cut ; 
the borders between them are zones not lines. Further, we cannot be sure that 
the group of operations proposed by Piaget of reversibility, combination of 
compensated relations, and identity, are, in fact, sufficient, although they may 
well be necessary before the child can attain conservation. 

In the case of non-conservers, there is clear evidence that thinking is 
frequently dominated by perceptual ‘ centrings,' usually on single dimensions or 
equalities. Table 3 shows that forty out of fifty-four children at this stage 
replied to the effect that “its more because its longer” or some essential 
variation of this. The remaining fourteen either could not give a reason or 
advanced all manner of apparently irrelevant arguments such as, " mine's a 
sausage "; " you've rolled it." Furthermore, Table 3 shows that twenty-four 
non-conservers gave clear evidence of a change of ' centring ' during the experi- 
ment and it is possible that this double“ centring ' may influence the attainment 
of conservation in some instances. 

Piaget has unequivocally stated (1950, p. 140 and elsewhere) that the 
co-ordination of relations (e.g., height makes up for what is lost in length) is 
necessary before conservation comes about. From Table 4 we see that six 
non-conservers can consider two dimensions in compensatory fashion when 
forced to do so, but fail to conserve. Of the children at the transition stage 
seven conserve temporarily when forced to consider two dimensions, while a 
further four conserved at the end of the experiment. , 

Again, Piaget has stated (1950, p. 141) that the child at the non-conserva- 
tion and transition stages had, on occasion, admitted a return to the starting 

oint, without this ‘ empirical reversal’ constituting a complete reversibility. 
We do not understand exactly the difference between this ‘ empirical reversi- 
' and a complete reversibility, but we do know from Table 5 that some 


bilit ea Y 
children gave evidence of reversibility, as we have defined it, and yet did not 


conserve. 
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At the transition stage suggestion is strong in some children. They may 
give the answer they think is required and not their own opinion, but it was not 
found difficult to obtain, in answer to later questions, what seemed to us to be 
their own viewpoint. At this stage the child may begin the experiment by 
conserving but later resort to non-conservation ; or he may begin as a non- 
conserver, then conserve as the experiment proceeds, and finally revert to non- 
conservation again. The striking feature about children at this stage is that 
they are uncertain. The child is stumbling towards the concept of conservation 
and may revert to non-conservation at any time. 

In the final stage the equality of the ball and sausage, from the point of 
quantity, is always maintained. But one can still note the lingering influence of 
perception because some children will readily admit that the ' sausage’ (or 
ball) looks more. In the first year, 13 per cent. of the age group were unable to 
give a reason, but all the other pupils did in all the year groups, and many 
spontaneously gave more than one reason. Some say, " As they were the same 
before ” (reversibility) ; some say, " You haven't added any,” or “ You 
haven't taken any oft," (operation of plus/minus) ; others say, “ Although its 
longer its thinner ” (co-ordination of relations). 

A small number of children justify conservation on the grounds that the 
ball held by the experimenter could be changed into a sausage similar to that 
held by the testee (identical action). A few children say that their lump of 
plasticine has only undergone a change of shape, or that the weights of the ball 
and sausage remain constant. Piaget does not mention these reasons as far as 
is known to the writers. The last is particularly interesting for Piaget maintains 
that conservation of wcight comes later than conservation of substance. He 
may well be correct and experiments are being carried out to verify his views. 
But even now we suggest that substance is conserved more safely when supported 
by conservation of weight. 

In the first subsidiary experiment we have shown that witnessing and 
agreeing to the operation of subtraction of a small piece of plasticine does not in 
itself necessarily produce conservation. But it is true that the operation does 
help the child to fix his attention on quantity rather than on dimension, and it is 
this that seems to help children to conserve who were previously not doing so. 
About a quarter of the children tested who were non-conservers in the main 
experiment, now conserve or almost do so. Among those at the transition stage 
the move towards conservation is roughly doubled, but this could have been 
partially anticipated since the lines between the stages are not clear cut. It is 
the decision in the main experiment, to place all those who change their mind 
into the transition stage, whether they conserved to the end or not, that enables 
some children to become conservers after this operation with apparent ease. 

In the second subsidiary experiment we have demonstrated that children 
who are non-conservers in one situation involving quantity, are not inevitably 
non-conservers in another. Indeed, about one-third of those who were non- 
conservers in the experiment involving plasticine were conservers in the 
experiment employing the rubber band. Hyde (1959) also found that some 
children who were non-conservers in a test using plasticine balls were conservers 
when a liquid was poured from one vessel into another of different shape. For 
example, when liquids in similar vessels A and B, which certain children agreed 
were equal in amount, were poured into glasses C and D, respectively, they agreed 
that the amounts in the latter vessels were equal in spite of great perceptual 
differences due to C and D being of entirely different shapes. Yet, when the 
liquids in A and B were matched again for equality, and the liquid in B was 
poured into a number of smaller vessels which were all similar to one another, 
some of those who previously agreed to conservation, now denied it. Beard 


— 
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(1957) also found among sixty children aged 6 and 7 years, that some who were 
non-conservers when comparing balls of plasticine, could conserve when water 
was poured from one vessel to a number of smaller ones. 

Thus, it seems that children who are conservers of continuous quantities 
in one situation are not inevitably conservers in another. On this point the 
more recent findings seem to lead to conclusions at variance with those of Piaget. 
Our interpretation of his view is that once the concept of the conservation of 
substance has been attained, it holds in all situations involving conservation of 
substance. If we have understood him correctly, then our evidence, also that of 
Hyde and Beard, does not support his viewpoint, which seems to us to hide the 
particularities of child thinking. It seems rather than the concept is applicable 
only to highly specific situations at first and that it increases in depth and 
complexity with experience and maturation. Piaget's view that the child arrives 
at the concept because he is able to argue logically in concrete situátions may or 
may not be correct. A careful scrutiny of all our evidence does not enable us 
to prove or disprove his viewpoint. It is equally likely that the concept of 
conservation of substance—or indeed, any concept—grows out of the inter- 
locking of several organisations of past impressions that normally remain outside 
consciousness (schemata) which in turn grow out of many and varied experiences. 
The child may then invoke logical argument to justify the concept which was 
obtained on other grounds. Or, it may be that experience and the ability to use 
logical thought aid one another and bring about certainty. 

The following points also arise from our study : 

(1) Our evidence suggests that in the first two years of junior school there 
are many children who have not attained much concept of the invariance of 
Substance. It must not be assumed that the children of this school were entirely 
representative of British junior school children generally, and it is our opinion 
that in some other junior schools the proportion of conservers might be some- 
what higher. 

2) From work reported here and elsewhere, there is, in our view, no doubt 
that Piaget's type experiments are in themselves learning situations. 

3) Ourexperiments have shown the grave dangers involved in accepting a 
child's apparent understanding at face value. 

4) Non-conservation of substance may affect the lives of children and less 
able adults more than is generally recognised. For example, we have evidence 
that some children are prepared to pay more money for a piece of toffee when 
it is in one shape than when it is in another, 

(8) Our work has confirmed the considerable verbal confusion known to 
exist in children in the first two years of the junior school. They confuse such 
terms as longer, fatter, shorter, bigger, thicker, smaller. Here are two examples 
from first-year children : 

“Jt gets thinner and bigger, it doesn't get fat though." By bigger we 
assume he means /onger. 

“ It'll get right long and smaller and smaller." By smaller we assume he 


means thinner. 

Piaget, too, has read into the child's remarks what he thought the latter 
meant, although we cannot be sure that either Piaget or ourselves always 
judged correctly on this issue. To us this confusion supports our view that the 


idea of quantity grows slowly. 
(6) Reversibility, in the sense in which we have defined it, is the most 


frequently given reason for conservation at all ages. 
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AGGRESSIVE AND WITHDRAWING CHILDREN IN THE 
NORMAL SCHOOL 


II—DisPARITY IN ATTAINMENT 


By E. A. LUNZER 
(Department of Education, University of Manchester.) 


Summary. Inarecentarticle, Lynn put forward the view that anxious children 
tend to show a higher performance in reading as compared with arithmetic. A 
group of aggressive children and a group of withdrawing children were given 
tests of reading and arithmetic and the disparity scores compared. There is a 
strong tendency for the former to perform badly in arithmetic and Raven’s 
Matrices. These results are discussed in the light of available evidence. 


J.—DESIGN OF THE INVESTIGATION. 

In a previous paper (Lunzer, 1960), we reported an investigation designed to 
compare the adjustment of three groups of children, selected by class teachers as 
most aggressive, most withdrawing or ‘ exceptionally well-adjusted.’ 

Following on the hypothesis advanced by Lynn (1957), it appeared likely 
that the withdrawing sample would show a marked superiority in reading as 
compared with their performance in arithmetic. It seemed possible that the 
aggressive sample would reach a comparatively higher standard in arithmetic, 
although here the theoretical grounds were less certain, except insofar as 
statistical considerations alone might tend to create such a picture if the 
disparity was clear-cut for the withdrawing. In order to test these hypotheses, 
all children in the three samples aged 8 and over were given, the following tests : 


(1) Schonell’s Graded-Word Reading Test (R1). 

(2) Schonell's Silent Reading Test, Form B (R4). 

(3) Vernon's Graded Arithmetic-Mathematics Test (here abbreviated to 
A/M). 

(4) Raven's Progressive Matrices (1938). 


The groups covered by the present inquiry consisted of 30 ‘ aggressive : 
(mean CA, 10 yrs. 4 mths, s.d. 19:08 mths.), 26 ‘ withdrawing ’ (mean CA, 
_# 10 yrs. 5 mths., s.d. 16:3 mths.) and 32 ‘ well-adjusted ' children (mean CA, 
10 yrs. 5 mths, s.d. 16-9 mths). The method of selection is described more fully 
in the previous paper. The samples cannot be taken as fully representative 
(but at the same time the population from which they were drawn was not 


highly selected). 


II.—RESULTS. 

The method of scoring was determined by the fact that we were concerned 
not with the absolute standards attained by children, but rather with their 
relative attainments on one test as compared with another. It could not be 
assumed that the original standardisation samples were comparable for all four 
tests and hence it did not seem legitimate to compare attainment ages. The 
procedure finally adopted was to convert all raw scores into sigma scores, using 
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the subjects of the investigation as an ad hoc standardisation sample. Hence, 
the scores referred to below have no meaning outside the samples on which they 
were obtained, but they are comparable as between one test and another. The 
overall mean score on each of the tests is zero and the standard deviation is 1. 
Table 1 shows the mean scores together with the standard deviations of the 
three groups taken separately (expressed in terms of the overall distribution). 


TABLE 1 


COMPARATIVE ATTAINMENTS ON Four TESTS. 


A 
(N—30) 
M — -1370 
RSEN Qus R ee: g 1:0339 


M— -2670 3097 

OT EE E c 1:1267 -8486 
M— -3037 | 44-4456 

o 1-0158 7663 

M— 4223 — -0381 i 4159 

— c 1:0997 1-0669 ‘7218 


It may be seen that the well-adjusted group show a uniformly higher per- 
formance on all four tests, a result which is not unexpected. Moreover, their 
scores are less widely distributed than those of the two poorly adjusted groups. 
The latter differences are significant, as is shown by the application of Bartlett’s 
test of homogenity. 


TABLE 2 


HETEROGENEITY OF VARIANCES IN FOUR TrsrS. 


significance 


05 


10 
05 
05 


An inspection of the distribution of scores obtained by the three groups 
showed that the lower variance of the well-adjusted sample is due to the com- 
parative rarity of very low scores, rather than to the concentration of scores 
about the average of the group. In other words, some of the poorly-adjusted 
children, and none of the ' exceptionally well-adjusted ’ were very backward. 
Needless to say, the latter finding may be solely the result of teachers’ selection, 
although this is unlikely. 

Some allowance must be made for lack of homogeneity in interpreting 
analyses of variance (Lindquist, 1953). Results of these are as follows : 
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TABLE 3 


Differences between groups are not signficant for the two tests of reading. 
For the arithmetic and matrices tests the level of significance is sufficiently high 
to warrant some confidence in spite of the lack of homogeneity. An inspection 
of Table 1 shows the greatest difference to lie between the aggressive and the 
well-adjusted groups, 


in favour of the latter. 
However, these results can best be considered after an examination of 
difference scores as 


between tests : 


TABLE 4 


DISCREPANCIES IN ATTAINMENT ON Four TESTS SHOWN BY THREE GROUPS OF CHILDREN. 


A (N=30) Ww (N=26) 


M---166 — +0392 
-8280 


RI—A/M 
EE E 
————— Mas 1.1370 

4976 


+2123 
7191 


Matr.—Rl . Lea 
` + (0365 


9662 +7487 


Matr.—R4 


—— 


qug | 073 


.9373 +6384 7625 


Matr.—A/M 


t discrepancies as between the three 


ying the greates i 
The two results showing of variance. Neither of these proved significant. 


sis ; 
Soupe vue De 1-44 and for Matr.—R1 it 15 2-88. The latter is 
not far short of the “D5 level (3-13). 
]II.— DISCUSSION. 
= x the trend of these results is in direct contradic- 
"" It is interesting ix advanced earlier. It may be that the comparatively poor 
tion to the hypothesis ve children in arithmetic and in the matrices test is due 
performance of aggre S jessmiess- These children have no desire to conform to 
to their tendency to CFF) their ratings on this trait (Lunzer, 1960). With- 
adult authority, as 1$ sho he hand, are happiest at repetitive and mechanical 
drawing children, on thé d ard even in this type of task is below that of the well 


work, although their stand 
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adjusted. It is assumed in this interpretation that performance both in arith- 
metic and in the first two sections of the matrices test is heavily loaded with the 
effects of perseverance at a relatively low level. It would follow that the 
relationship found in this study should not be universal. Where the basic 
skills in reading and arithmetic have not yet been established, neither can be 
pursued in a mechanical fashion, and one cannot, therefore, generalise with 
safety as to which of these subjects will present a child with the greater trustra- 
tion. It is significant in this connection that the mean discrepancy between R1 
and A/M for eighteen aggressive children whose reading and arithmetic ages were 
both above 9-0 was 0-2317 in favour of reading, while there was little to choose 
between proficiency in reading and arithmetic for the remaining twelve children 
who had not attained a reasonable standard in both (mean discrepancy 0:0692). 
It remains possible that if the comparison is made at a much higher level, 
between mathematics and literary subjects, children of anxious disposition 
(including withdrawing children) will show specific failure in the former, not as 
a result of carelessness, but because, at such a level, the work demanded is no 
longer of a mechanical nature (Skemp, 1958). 


In view of the comments made by Reed and Schonfield (1958) on Lynn's 
interpretation of his results, the present findings are not so much at variance 
with these as might be supposed. Lynn found that children who made high 
scores on two ‘anxiety ' tests were significantly better at reading than arith- 
metic. For one of these tests no clinical validation is given, but the other is a 
test found by Himmelweit and Petrie (1951) to discriminate between neurotics 
and normal children. Since the former sample was taken from children attending 
a child guidance clinic, without further specification as to the type of maladjust- 
ment, there is little reason to believe that the trend found by Lynn should not 
be due to the presence of a preponderance of ‘ aggressive ' children among the 
' anxious ' groups. 

It will be noticed in Table 3 that ' aggressive ' children tended to perform 
particularly poorly in the matrices test. It might have been supposed that both 
groups of poorly-adjusted children would tend to be ' under-achievers ' while 
the well-adjusted would be ' over-achievers ' or working to capacity. That this 
was not so underlines the caution made by Vernon (1958) with regard to the use 
of group non-verbal tests in the diagnosis of retardation. The marked weakness 
of the ' aggressive ' group in the matrices test may have been due to carelessness, 
but this we do not know. 

It remains that the question of disparity in attainment is not a simple one. 


The following points need to be taken into account : 
(1) The combined reliabilities of two tests must affect results. 


(2) The comparative difficulty of subjects is affected not only by the 
specific experience of children (teaching skill, absences, etc.), but also by the 
standard which they have attained in each. Thus, where the mechanics O: 
reading have not been mastered, there is every reason to believe that the 
learning of reading is at least as complex a task as the learning of arithmetic. 
At a somewhat more advanced level, both skills may show a high component of 
mere mechanical repetition but performance in arithmetic is more liable to 
errors of carelessness. Finally, at the secondary level we may find that skill in 
mathematics demands a certain elasticity of approach and flexibility of 
reasoning which are impaired by blockages due to anxiety. But, whatever the 
level, individual differences will be important, since the experience of success i$ 
the all-important factor in the acquisition of skills. 
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A NATIONAL SURVEY OF THE ABILITY AND ATTAIN- 
MENT OF CHILDREN AT THREE AGE LEVELS 


By D. A. PIDGEON 
(National Foundation for Educational Research.) 


1955 some 10,000 children attending maintained or assisted 
d Wales were given specially designed tests of reading, 
arithmetic and non-verbal ability. The children tested formed separate samples 
in each of three age groups. The age groups and the dates of testing were so 
chosen that the children in the three samples were : 

(i) Commencing their junior school course (7-8 years). 


Summary. During 
schools in England an 


(ii) Ending their junior school course (101—113 years). 

(iii) In the last year of compulsory education (14-15 years). 

A systematic but unbiased method of sampling was employed which was 
considered to be effectively random. 

The aims of the survey were : 

(i) To obtain reliable estimates of attainments as a base-line for future 

comparisons. 

(ii) To determine the degree of backwardness in each age group. 

(iii) To compare the levels of attainment of boys and girls, and of children 
attending schools in urban and rural areas and of various types, €. 
mixed, single-sex, all-age, voluntary. 

While the results obtained relevant to the first two aims cannot be evaluated 
until the repeat of the survey, planned for 1960, is carried out, this paper gives 
some of the more general findngs relating to the levels of attainment achieved 
by the children in the three age groups, and to the differences in performance 
observed between children in various sub-groups. 


I.—INTRODUCTION. 


Ix 1955, in accordance with its declared policy, the National Foundation for 
Educational Research carried out a survey of the ability and attainment of 
representative samples of children in England and Wales. Before the survey 
took place its aims were stated as follows : 

i) To estimate average levels and ranges of attainment in reading and 
arithmetic for the three age groups 7+, 10+ and 14+, in order to discover 
what are the existing standards and thus to provide a basis for future com- 
parisons. 

(ii) To determine the amount of backwardness in each age group and to 
estimate how much of this backwardness is genuine retardation, due to a variety 
of factors, and how much is attributable to low general ability (as measured by 
a non-verbal test of intelligence). 

(iii) To compare the levels of attainment of boys and girls, and of children 
attending schools, in urban and rural areas, and ot various types, e.g., mixed, 
single-sex, all-age, and voluntary. 

Some results bearing on the second aim were published in 1956 (Pidgeon 
and Yates, 1956). The important issues raised by this publication and by the 
subsequent discussion on the relationship between ability and attainment that 
appeared in the correspondence pages of Education (Burt et al. 1956) are still 
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not entirely resolved. It was clear, however, that the data from the 1955 
Survey could not be used to obtain measures of ' under-functioring ’ and hence 
an operational definition of backwardness had to be adopted. This meant that 
the second aim of the survey, quoted above, had to be modified and further 
that the data on backwardness derived from the survey could become fully 
intelligible only when compared with similar data obtained by repeating the 
same tests on a subsequent occasion. 

Similarly, the results of the survey relevant to the first aim—the establish- 
ment of a base-line for future comparisons—also cannot be evaluated until the 
repeat of the survey which is planned for 1960. 

Accordingly, this article is concerned first of all with a brief description of 


the survey—the sampling procedure, the tests used and the administrative 


problems, and secondly, with such results as are relevant to the third aim, 


namely the comparisons of the performances of children in different sub-groups. 


IIL—THE SAMPLES OF CHILDREN. 


The survey was confined to grant-aided schools maintained or assisted by 
local education authorities in England and Wales. It was carried out at three 
age levels and the populations involved at each level were defined as tollows : 

7-4- Group: Children in infants, junior and all-age schools who had reached 
their seventh birthday but not their eighth by August 3lst, 1955. 

10-+ Group* : Children in junior and all-age schools who had reached their 
tenth birthday but not their eleventh by August 31st, 1954. 

14- Group : Children in all-age and secondary schools of all types who had 
reached their fourteenth birthday but not their fifteenth by August 31st, 1955. 

After careful consideration, the dates of testing were fixed tor October, 1955, 


for both the 7+ and 14-- age groups, and May, 1955, for the 10+ group. Thus 
children were tested (a) at the beginning of their junior school course (actual 
age range 7:2 to 8:1); (b) at the end of their junior school course (actual 
age range 10:9 to 11:8); and (c) at the end of the period of compulsory 


schooling (actual age range 14:2 to 15: 1). 


IIL.—THE SAMPLING PROCEDURE. 

There is not space here to give full details of the procedure employed. 
Briefly, the sampling unit was defined as the school and all children in the 
appropriate age group in each selected school were tested. The samples of 
schools were picked directly from the lists (kept at the Ministry of Education) 
of all the schools in England and Wales covering the defined populations. For 
the 7-+ and 10+ age groups, in order to ensure the adegan ee of 
schools of different types and in urban and rural areas, random sampling was 
vw pabuli h of these strata. For the 14+ age group, the 


carried out separately in eac 
administrative distinction between urban and rural areas was less important 
and hence, here, stratification was by sex of school in addition to type. The 


sampling method was thus systematic but unbiased and was considered to be 


effectively random. s Él 

Tables 1 and 2 give the number of local education authorities involved 
and the number of schools in each age group. It will be noticed that one 
English county council education authority refused to partipate in the survey. 
It was not thought, however, that this made any material difference to the 
representativeness of the results. 


* This is, of course, the age group more usually known as 11+. 
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TABLE 1 


NuMBER OF L.E.As. TAKING PART IN THE SURVEY. 
———— ————ÜÓ—— ae 


Nos. of L.E. As. with schools in No. of 
= 1 L.E.As: 
3 age 2 age 1 age with no Totals 
groups groups group schools 
English C.Cs. ... ese 15 18 12 4 49* 
English C.Bs. +--+- -5 1 8 24 46 79 
Wales  ..eeÓ 3 3 6 5 17 


a eee | 19 29 42 55 145* 


* One English C.C. refused to participate in the Survey. 


TABLE 2 


NUMBER OF L.E.As. AND SCHOOLS IN THE THREE AGE GROUPS. 


7+ Group 104- Group 144- Group 
L.E.As. Schools L.E.As. Schools LEAS Schools 
English C.Cs. 32 58 37 62 24 42 
English C. Bs. 12 14 16 17 15 17 
Wales ...-++ 5 8 10 12 6 8 
^^ fot ..| 49 80 63 | s 45 | 86 


The numbers of children tested were approximately 2,850 at 7-+, 3,200 at 
10+ and 3,650 at 14+. In the results to be given later, the actual numbers 
vary, since not all children were present for every test. 


IV.—THE TESTS GIVEN. 


The 7-- group was given a 60-item non-verbal picture test and specially 
designed tests of reading comprehension and mechanical arithmetic. The 
picture test was the N.F.E.R. Picture Intelligence Test 1 and its reliability 
calculated by Kuder-Richardson formula 20 was :93. The arithmetic test 
consisted of four separate sub-tests of 20 items each, in addition, subtraction, 
multiplication, and division, administered untimed on different occasions. The 
Kuder-Richardson reliability of the total test of 80 items was -97. The reading 
test contained 60 sentence completion items and was the same as that given to 
children in the older age groups. Unfortunately, although it proved to be 
highly successful with children of 10+ and 14+, it was found not to give 
effective discrimination, with the younger children and hence had to be omitted 
from the analyses. 

The 10+- group completed four tests; the 60-item reading comprehension 
test (K-R reliability -96) a. 64-item non-verbal test (K-R reliability -94) and 
two tests of arithmetic. The first of these (Mechanical Arithmetic 1, K-R 
reliability -92) was a 25-item mechanical test ranging from simple addition to 
multiplication and division of fractions and decimals, given without time 
limits. The second (Problem Arithmetic II, K-R reliability -98) consisted of 
100 items, 40 mechanical and 60 problem, administered with a time limit of 
50 minutes. This test was, m fact, similar to the usual type of arithmetic. test 


D. A. PIDGEON 127 


used by many local education authorities as part of their 11+ allocation 
procedures. 

The 14+ group completed the 60-item reading comprehension test (K-R 
reliability, with this age group, -96) and the 25-item untimed mechanical 
arithmetic test given to the 104- age group (K-R reliability with this age group 
-97). In addition, they were given a 96-item non-verbal test (K-R reliability 
-96) and an 85-item timed problem arithmetic test (Problem Arithmetic ITI, 
K-R reliability -97) which included items common to all secondary school 
curricula. 

The tests in all three age groups were given on the mornings only of three 
consecutive days. 


V.—ADMINISTRATIVE ARRANGEMENTS. 


The administration of a survey of this kind is exceedingly complex and 
only the barest outline can be given here. In order to ensure uniform con- 
ditions for testing, every participating local education authority was asked to 
appoint a representative who was to be responsible for the local administration. 
These representatives were met at seventeen regional meetings, scattered over 
England and Wales, by officers of the Foundation, who briefed them on all 
details of the survey. In many cases the representatives, often psychologists, 
carried out the testing themselves. In other cases they in turn briefed the 
teachers in the schools concerned for this task. Full details of the test admin- 
istration were given in instruction booklets. The testing was completed by all 
schools within a two-week period. 

It was decided to test on one occasion only, and children who were absent 
for all tests were ignored. Each test was standardised to a mean of 100. and 
S.D.15 on the sample formed by all children who took it, although the joint 
analysis of the results was performed only on the sample of children in each age 
group who took all tests. The scores ot the 150 or so ‘ partial’ absentees were, 
in fact, examined and found to average some four points of standardised score 
below the mean; their omission from the main group, however, had a trivial 


effect on the mean. 
VI.—RESULTS. 


As has already been mentioned, there is little that has any real meaning 
to be reported at this stage on the question of backwardness. Ţ The pro- 
portions of children scoring below certain arbitrarily defined points can be 

solute value and only assume importance when 


calculated, but these have no ab é 
it can be shown that after a passage of time they have changed. However, 


two of the tests were given to more than one age group and the overlap in 
performance at different ages is of passing interest. i 

() Overlap in performance between age groups. The same reading test 
was given to children in all three age groups. As was indicated earlier, this test 
proved too difficult for the seven-year-olds to allow of any detailed analyses to 
be made with this age group, but the mean score obtained was found to be just 
over 7 points. The proportion of children in the 10+ age group scoring 7 or 
below was found to be 4-0 per cent., while the corresponding proportion in 
the 14+ age group was 1-4 per cent. It was also found that 17-0 per cent. 
children aged 14+ scored less than the 10+ mean, while 20-3 per cent. of the 
10-- age group scored more than the 14+ mean. 

In arithmetic, the only comparison of this kind that can be made is between 
the 10+ and 14+ age groups. On the mechanical arithmetic test given to both 
age groups 25-7 per cent. of the older children scored less than the 10+ mean, 
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while no fewer than 42-0 per cent. of the 10+ group scored above the 14-- mean. 
The higher proportions for this test can be accounted for by the fact that it was 
easier for both age groups ; the distributions being negatively skewed and the 
means relatively closer together. The test was, in fact, included in the survey 
primarily in order that a study could be made of the responses to individual 
items. It is intended to report on the results of the analysis of this test on a 
later occasion. 

(ii) Over-lap im performance between modern and grammar school pupils. 
One further comparison of interest is that between pupils in selective and 
unselective secondary schools. In the reading test at 14--, approximately 4 per 
cent. of the children in modern and all-age schools scored above the mean for 
grammar school children. This figure can be compared with that of 5 per cent. 
quoted in the Ministry's report on reading (Ministry of Education 1957). When 
this same test was given to the 10+ group, the children were, of course, still in 
primary schools. Subsequently, however, information was obtained on the 
type of secondary school to which they were transferred and the extent of the 
overlap at this age calculated. At 10+, also, approximately 4 per cent. of 
children who were subsequently transferred to modern and all-age schools 
scored above the mean for children who later went to grammar schools. In 
both cases, of course, the figure quoted is only one way ot reporting the lack of 
perfect correlation between the reading test and whatever procedure was used 
to allocate the children to the selective and unselective schools. It is interesting 
to note that the figure is unchanged between 10+- and 14+. 

Similar calculations were made for the mechanical arithmetic Test 1. 
Here, however, the extent of the overlap, that is, the percentage of modern and 
all-age children scoring above the grammar school mean, was 6 per cent. at 
10+ and 22 per cent. at 14--. The higher figures presumably again reflect the 
limited range of content of the test. 

(iii) Sex Differences. Table 3 gives the mean standardised scores for boys 
and girls separately on the tests given to all three age groups. The final column 
gives the difference of the means for the two sexes with standard errors. 


TABLE 3 


MEAN STANDARDISED SCORES FOR Boys AND GIRLS SEPARATELY ON TEsTS GIVEN TO ALL 
THREE AGE Groups. 


Age Group and Test Boys Boys—Girls 
with S.Es. 


Non-Verbal : 100-09 99-71 0-38 (0-67) 
Mechanical Arithmetic 99-14 100-91 — 1-77 (0-66) 


10+ 
Non-Verbal 99-97 101-02 — 1:05 (0-56) 
Reading : 99-71 100 — 0-68 (0-57) 
Mechanical Arithmetic I 98-73 102-0: — 3-30 (0-63) 
Problem Arithmetic II 100-00 100-81 — 0-81 (0-60) 


14- 

Non-Verbal 100-25 101-16 —0-91 (0-71) 
Reading — ++ sees eee 100-15 101-12 —0-97 (0-80) 
Mechanical Arithmetic I 99-97 100-29 —0-32 (0-63) 
Problem Arithmetic ILI 102-73 97-88 4-85 (0-76) 


E 
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As will be seen, the differences for the non-verbal tests at all three age 
groups, and those for the reading rests at 104- and 14+, do not achieve stat- 
istical significance. A brief word of comment can be made about the reading 
test. Although not significant, the higher mean scores achieved by girls on this 
test are consistent with the results obtained when this same test has been used 
in other researches. It will be recalled, however, that the Ministry of Educa- 
tion's Reading Surveys (Ministry of Education, 1957) which used a similar type 
of reading comprehension test, found a sex difference in favour of boys at both 
1l and 15. It is clear that the explanation of these sex differences lies in the 
content of the tests and not in any real superiority of one sex over the other. 


The most interesting results from Table 3 are those for the arithmetic 
tests. In mechanical arithmetic it will be seen that girls obtain a significantly 
higher mean score at 7--, that is at the beginning of the junior school course. 
At the end of the course the difference in favour of girls is even greater. But, 
after 3 years in secondary schools the difference has all but disappeared. In tests 
involving problem arithmetic, girls at 10+ obtain a higher mean score than 
boys, although the difference is not significant. By 14-- however, boys’ superi- 
ority is nearly 5 points of standardised score. These results indicate that there 
is distinct falling off in the performance of girls in this subject between the ages 
of 11 and 14. 

It is too often assumed, especially by secondary school teachers, that boys 
are inherently capable of better performance in mathematics than girls, although 
no valid evidence for this assumption exists. Certainly, the fact that at 14 boys 
are superior to girls cannot be used as evidence to support this contention, since 


at the age of 11 and even as young as 7, the opposite is the case. . 

From the item analyses carried out on the mechanical arithmetic tests, the 
average percentages of boys and girls in both the 10+ and 144- groups answering 
correctly the 25 items of the mechanical arithmetic Test 1 were calculated, and 


are given in Table 4. 


TABLE 4 


A PERCENTAGES OF BOYS AND 3iRLS IN DIFFERENT TYPES OF SECONDARY ScHOOL 

AVERAGE PERCENTAGES AND GI J; 

i Re 10+ AND 14+ ANSWERING ITEMS ON MECHANICAL ARITHMETIC LEST I CORRECTLY. 
/ A ANS 


Secondary Modern} Selective Schools All types 


and All-age 


10+ 144 


oy 
/o 

70-0 

68-2 


Difference: Boys—Girls 


ately for children attending 


f 
i ondary schools. The data at 10+ were, 0 
pe vpn still in primary ine but e 

1 the children subsequently 
f secondary schoo! wiped 


that the superiority of grammar sc 


This table presents these percentages separ 


selective and non-selec C 
course, obtained when the chil 
analysed according to the type O 
attended. It is interesting to note 
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is more or less maintained between 11 and 14, and it is in the Modern and All-age 
schools that the girls appear to have done relatively badly. 


Clearly, a further investigation into the teaching of this subject to girls in 
Modern schools would be interesting, or possibly for an explanation it is neces- 
sary to examine the way the subject is taught in primary schools (Dienes 1959) 


(iv) Urban-Rural Differences. With very few exceptions, large-scale 
surveys involving a comparison of children's test performance in urban and rural 
areas have demonstrated the superiority of urban children. (Terman and Merrill 
1937, Cross and Revell 1956, Bristol Institute of Education 1952, Morris 1959, 
Ministry of Education 1957). In general, the data from this survey support these 
previous findings. The mean standardised scores on the tests given to the 7+- 


and 10-+ age groups for urban and rural children are given in Table 5. 


TABLE 5 
MEAN STANDARDISED TEST SCORES FOR CHILDREN IN URBAN AND RURAL AREAS IN THE 
7+ AND 10+ AGE GROUPS. 


Urban-Rural 


Age Group and Test 
with S.Es. 
7+ 
Non-Verbal 97:12 
Mechanical Arithmetic 101-27 
10+ 
Non-Verbal ........«- 100-91 98-06 2-85 (1-54) 
Reading ...+--+> . 100-61 96-85 3-76 (1:55) 
Mechanical Arithmetic I 100-84 97-47 3-37 (1:45) 
Problem Arithmetic II 100-90 97-56 3-34 (1:44) 


The 7-+ mechanical arithmetic test results are an interesting exception to 


the general trend, but the difference is not statistically significant. In the 
attainment tests at 10-+, the rural children come a very poor second. 
at work in 


However, as Barr (1959) has pointed out, the main influence 3 
producing results such as these is not the nature of the rural or urban environ- 
ment itself, but rather “ the structure of the socio-economic group from which 


any school draws its pupils.” 


The explanation ot the differences in Table5 then, may well be that thereis a 
high positive correlation between test scores and home environment (Fraser, 
1959) and there are considerably fewer ‘ better ' homes in rural areas. 


(v) Designation of School. No stratification by designation of school was : 
made in the sampling procedure, but after the samples of schools were known, 
local authorities were asked to state whether it was a county or voluntary school. 
Using this information, the mean standardised scores for children attending the 
different schools were determined. Since the numbers in the different types of 
voluntary schools were fairly small, they have been grouped together for com- 
parison with County schools. In fact, only trivial differences were revealed 
between these types- The results are given in Table 6. 
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TABLE 6 


MEAN STANDARDISED TEST SCORES FOR CHILDREN ATTENDING COUNTY AND VOLUNTARY 
SCHOOLS IN THE 7-- AND 10+ AGE GROUPS. 


Age Group and Test County Schools |Voluntary Schools|County-Voluntary| 
with S.Es. 


Te 
Non-Verbal 101-08 
Mechanical Arithmetic 100-38 


10+ 
Non-Verbal 101-72 
Reading as 100-91 
Mechanical Arithmetic I 101-74 
Problem Arithmetic IT 101-87 


In order to account for the highly significant differences that were found, 
the hypothesis can be put forward that voluntary schools are more likely to be 
in poorer social areas. 

Unfortunately, no data on the socio-economic status of the catchment 
areas for the schools concerned were collected in the National Survey. 


(vi) School Type Differences. The results of the survey relevant to the 
differences in performance of 7+ and 10+ children attending junior with 
infants and junior without infants schools have been published elsewhere 
(Pidgeon 1959). They can conveniently be summarized here by saying that 
although the mean standardised scores in nearly all cases favoured the latter 
type of organisation, when further analyses were carried out in an attempt to 
eliminate the effects of the social grouping of the catchment areas for each 
school ME differences disappeared, or, in the case of reading at 10--, were 
reversed. 


With the 14+ age group differences in performance between children 
attending modern and grammar schools are, of course, expected, but of greater 

- interest are the differences between children from modern and all-age schools. 
The mean standardised test scores for children from different types of secondary 


school are given in Table 7. 


TABLE 7 


MEAN STANDARDISED SCORES FOR CHILDREN oF 14+ ATTENDING DIFFERENT TYPES OF 
SECONDARY SCHOOL. 


Type of School 


All-age Modern Grammar 


‘Non-verbal 94-82 (1-11) 97-33 (0- 115-04 (0-97) 
Reading 92-10 (0-61) | 9737 (l- 117-68 (0-81) 
Mechanical Arithmetic I... 97-76 (1-28) 97-54 E 109-79 (0-84) 
Problem Arithmetic IIL....| 93:96 (1-10) 96-20 (0. 117-57 (1:36) 
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Many theories can be advanced to explain the differences in results 
between all-age and modern schools. Again, one obvious one is the fact 
that modern schools, being a relatively newer type of school, are more likely 
to be found in better areas ; greater numbers of all-age schools are still to be 


found in poorer rural districts. Clearly, further evidence must be obtained if 
this hypothesis is to be tested. 


(vii) Sex of School. The last comparison to be made from these survey data is 
that of the differences in performance of children in the 14+- age group attending 
single-sex or mixed schools. In this age group the stratification in the sampling, 
apart from type of school, was on this variable. The mean standardised scores 
for children attending single sex or mixed schools, whether they were grammar, 
modern or any other type, are given in Table 8. As will be seen, the differences 
for all tests except the mechanical arithmetic are significant. 


TABLE 8 


MEAN STANDARDISED TEST SCORES FOR CHILDREN IN THE l4-- AGE GROUP ATTENDING 
SINGLE-SEX oR MIXED SCHOOLS. 


Test Single-Sex Mixed Schools Single-Sex— 
Schools Mixed with $ 


l Non-Verbal ; á 102-95 == j 98- | . (1-35) 
Reading 103-10 98-8 *59 (1-53) 


Mechanical Arithmetic I i 100-90 R +49 +22) 
Problem ArithmeticlII .... 101-95 98-6 3. (1:48) 


One possible explanation of the differences may be that where the choice is 
offered parents still prefer single-sex Schools, and since many long-established 
grammar schools are of this kind, in any 11+ allocation procedure, they fill 
up first with the brighter pupils. To test this, the performances of children in 
the two kinds of school were considered separately in the grammar and modern 
strata. The results, however, showed no clear pattern and certainly gave no 
indication that this was the correct or only explanation. It is also possible, of 


course, that an explanation of the differences reported in Table 8 lies in the fact 
that single sex schools tend to be in better areas. 


. . VII.—CoNcrusiow. 

The aim of this paper was to present some of the results obtained from 
administering tests to fully representative samples of children from the whole of 
England and Wales. For the most part, the results reported here have been 
confined to the comparisons of certain sub-groups, and in doing this far more 
problems have been raised than solved. The survey which provided the data 
was not designed to provide evidence to explain any differences found ; its 
intended purpose was merely to discover whether differences existed or not. 

In presenting the results the attempt has been made in some cases to offer 
certain explanations of discovered differences in performance between sub-groups 
of children, but it is not suggested that these explanations are the correct or 
only ones possible. In the case of children from urban and rural areas, it seems 
probable that the differences in performance found can, to a very large extent, 
if not completely, be accounted for by considering social and cultural factors 
associated with the home backgrounds of the children concerned. It may well 
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be true that similar factors affect many of the other differences reported in this 
paper, but certainly more exact evidence is required before a claim of this kind 
can be substantiated. In 1960, the National Foundation intends to repeat its 
survey and on that occasion it is hoped to obtain additional information which 
may help to answer the many questions still outstanding. 
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THE MEASUREMENT OF EFFICIENCY OF SELECTION 
BY PROCEDURES OF VARYING VALIDITIES 


By FRANK SANDON 
(Sometime Selection Officer, Birmingham Education Committee.) 


Summary. Tables are given to show the numbers and percentages ' correctly ' 
chosen at various intensities of selection by selection procedures of various 
validities. The tables are derived on the assumption of bivariate normal 
distribution, i.e., on the assumption that the measures of both the selection 
procedure and of the criterion are normally distributed, that the regressions 
are linear, and that the various arrays of each measure are homoscedastic. 


I.—INTRODUCTION. 

Ix Statistics in Psychology and Education, Garrett! gives a table (Table 23, 
quoting from Taylor and Russell), showing the proportions considered satis- 
factory in achievement in the criterion range from -05 to -95, where the correla- 
tions between test scores and criterion may range from -00 to 1:00. The table 
does not appear so convenient as alternatives that can be derived from the 
Tables for the Volumes of the Normal Bivariate Surface? prepared by co- 
workers of Karl Pearson. These latter tables give, for various values of 7 (the 
correlation coefficient), the volume of the sector of the bivariate normal surface 
cut off at different values of 7; and of 7, the two independent variables. From 
these, Tables 1 and 2 have been derived. 


TABLE 1 
NUMBERS ' CORRECTLY ' CHOSEN By TESTS CORRELATING / WITH CRITERION AND INTENDED 
TO SELECT THE ToP p PER CENT. FROM A POPULATION OF 10,000 


3500 4000 
3031 3512 
2835 3306 
2682 3147 
2113 2552 3010 


2005 2436 2889 
1906 2330 2778 
1815 2231 2671 
1729 2138 2576 
1647 2050 2483 


1569 1965 2393 
1494 1883 2332 
1421 1804 2223 
1381 1727 2141 
1283 1652 2061 


1216 1578 1982 
1151 1506 1905 
1087 1435 1829 
1024 1365 1718 

963 1295 1678 


900 1225 1600 
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TABLE 2 
PERCENTAGES ‘CORRECTLY’ CHOSEN OF ALL CHOSEN By TESTS CORRELATING r WITH 
CRITERION and INTENDED TO SELECT THE TOP p PER CENT. OF A POPULATION. 


II.—USE OF THE TABLES. 

For the educational psychologist, the chief function of the tables is to 
indicate how selection actually works in, for example, selecting the brighter 
pupils for the more exacting of various courses, when the proportion of the 
whole population to be chosen for such courses is specified and the correlation 
between the selecting procedure and the criterion is known. Thus, if we wish 
to select the best 15 per cent. of a complete age group of 10,000 children by 
tests for which the coefficient of correlation (over the whole age group) with 
later success is -80, then we shall select 1,500. From Table | we see that, of 
these, there are 913 who are ‘ correctly ’ chosen, and 587 who are ' incorrectly ' 
chosen in place of a different group of 587 who are ‘incorrectly ’ omitted. 
Similarly, Table 2 shows that 61 per cent. (more precisely 60-9 per cent). are 
‘correctly ' chosen. Table 3 summarizes the position. 

TABLE 3 


Selection Procedure 


By Criterion Selects 


587 


| Not Suitable d 
Suitable 918 


1500 
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On the whole, if such a result were one in an ‘ 11+ examination,’ then the 
contributory primary schools would, as pointed out in 1936,° be fairly satisfied. 
Out of every twenty of their pupils the sixteen more unsuitable ones are not 
chosen and two of the more suitable ones are. There are two other cases: 
here the examination seems to have chosen the one who is the poorer and omitted 
the better (of course, at the border line there is really little to choose between 
these latter two). But the receiving grammar schools would find that out of 
every five of their new pupils three only are really fitted for the course. 


III.—SOME CONSEQUENCES OF THE TABLES. 

These features of the tables often seem to cause surprise among those who 
are not fully familiar with the nature of the bivariate normal surface,* and to 
cause grave dissatisfaction with the selection procedure among statistically 
illiterate politicians and members of education committees. Actual experience, 
however, shows that we always have imperfect correlation between the selecting 
procedure and the criterion. This is true whatever the selecting procedure, 
whether, e.g., by tests, by interviews, by impressions, or by school records. 
If we classify and reclassify, term by term on school reports, as we do in the 
procedure that has been called ‘ progressive differentiation,’ we still have this 
imperfect correlation. Even if in this last case we could get a true classification 
at the beginning of the school term, individual differences in the rate of the 
attainment of intellectual or academic maturity would mean that at the end 
of the term the classification would no longer be true—there would be imperfect 
correlation between the two rankings. In consequence, in any practical case 
of selection, we shall inevitably choose some ‘ misfits’ and reject some of the 
‘ fit,’ and, in the majority of cases, these ‘ errors ’ can be disturbingly numerous. 

The question of ' sets' as a matter of school organisation illustrates a corol- 
lary of these conclusions. In certain cases, as for subjects that are more or less 
specific to one sex or type of course (such as P.E., handicraft, or Latin versus 
German), statistical considerations hardly enter. If there is a subject that is 
highly specific in its factors, i.e., one that correlates only slightly with all the 
other subjects of a school course, then, if it is measured term by term with fair 
reliability (as, perhaps, may be the case for music), ' setting ' may be justified. 
But, in other cases, such as those where ' setting ' has frequently been recom- 
mended by official opinion (e.g., for Mathematics, or French) where the abilities 
are not highly specific (i.e., where the correlations of the marks for the particular 
subject with the marks for all the other subjects are high) the possibility of 

misfits ' in classifying, by form or by ‘set,’ is determined by the term-by-term 
correlation of the marks of such subjects or group of subjects. Experience in 
grammar schools gives a figure of about -8 for these correlations (slightly higher, 
of course, for all the subjects combined, slightly less for a single one). Hence 
fora three-form entry school (e.g., of about 100 pupils per year), if we try to get 
a form or set of, say, thirty-five, there will be only twenty-five pupils 

correctly placed in the top or in the bottom form or ' set.’ With ten ‘ misfits ’ 
per form or ' set each term it hardly seems justifiable to have the two classifica- 
cur form work and for ‘ set ' for special subjects—complicating the time 
_ It should also be noticed that we can obtain a larger proporti ‘ft’ 
in any selection procedure only by reducing our ree: ET rude the ra 
consequence, by increasing the actual number of the ‘ unfit’? Th m : 
dissatisfied with getting only 61 per cent. ‘ fit’ in 1,500 child: us, if we 
pai of 10,000 when we use a test for which the Mili ero oq e 
cho ear * d "Ma 

ose 75 per cent. ' fit ' by looking for 4,000 of the 10,000. But our standard of 


an 
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* fitness ' has gone down and we are actually admitting 4,000—3.010, or 990, 
who are ‘ unfit’ even at this low level, in place of our former 1,500—913, or 


587, ‘ unfit ’ at the more stringent level. 


IV.—SOoME OTHER APPLICATIONS OF THE TABLES. 
. _ The tables, while appropriate for practical problems in education as 
indicated above, are also suitable for use in other kinds of problem. Thus 
suppose that we are choosing, for any purpose, at a level that will select 15 per 
cent. of all twin children. Then the twin pairs may be split by the selection 
procedure. If the selection procedure correlation between the two individuals 
of each pair of fraternal like-sex (‘ two-egg ') twins is -60,5 we have the state 
of affairs shown in Table 4. There are only 673 twins that are selected, together 


TABLE 4 


B 


Twins X 


Twins Y Not Selected Selected 


827 
673 


Not Selected 
Selected 


1500 


with their co-twins, out of every 1,500 twins selected. For the identical (‘ one- 
egg’) twins, the correlation is higher. If weassume that theselection procedure fre- 
quency distribution of all the various kinds of twins is the same (an assumption 
that is not exactly true when we select by an examination®), then Table 3 
can be read to show that out of every 1,500 identical twins that are selected, 
there will be 913 that are selected with their co-twins. In other words, the 
receiving schools will find that there are proportionately more identical twins 
on their roll than fraternal twins than would be found in the complete age 
group. This feature works at the other end as well. There will be a larger 
proportion of identical twins to be found in the E.S.N. schools than in the 
ordinary schools. A further feature would be if we consider, instead of selection 
by an examination, selection by death. This will split the fraternal twin pairs 
more than it will split the identical twin pairs, so that the number of identical 
twin pairs will gradually increase proportionately through life when compared 
with the number of fraternal twin pairs. 
The tables can similarly be used for other cases of selection. 
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THE VALIDITY OF TESTS OF SPATIAL ABILITY AS 
PREDICTORS OF SUCCESS ON TECHNICAL COURSES 


By I. MACFARLANE SMITH 
(King’s College, University of Durham.) ` 


Summary. The object of this investigation was to assess the validity of two 
tests of spatial ability against criteria of success on technical courses in a 
secondary technical school. 

Scores on the usual selection tests as well as on a test of spatial ability 
were correlated with marks obtained in internal and external examinations in a 
number of technical subjects taken (i) three years, and (ii) five years after the 
original selection examination. 


L—INTRODUCTION. 

THERE is already extensive evidence that scores on spatial tests administered 
during the secondary school stage are predictive of proficiency in technical 
occupations, e.g., Holliday® and Shuttleworth.? The two tests to be discussed 
in the present article, (i) Spatial Test I (N.F.E.R.) and (ii) Moray House Space 
Test I, were constructed in order to measure the spatial ability of pupils in the 
age ranges 11 : 0 to 13: 11 and 10:0 to 11 : 11, respectively. As both of these 
tests may be regarded as pioneering ‘ efforts,’ it may be of some interest to give 
a brief account of their history and origins. 

A full account of the first of these tests (Spatial Test I) has already been 
published in an article which included detailed information regarding its content 
and reliability together with some preliminary evidence of its validity." The 
test was originally constructed in its main essentials early in 1934!4 and was 
printed in 1950 on behalf of N.F.E.R. as Spatial Test I. It was fully standard- 
ised over the age range 11 : 0 to 13: 11 on three complete age-groups totalling 
some 8,000 pupils, (Norms are now available for slightly younger children.) 
The test has been used bya number of loca] authorities as well as by numerous 
research workers (e.g., 21, 23). In the preceding years, research on somewhat 
similar lines had been carried out by a number of other workers such as El 
Koussy’, Drew,* Peel,!? Bradford,? Dempster,? Emmett,? Watts and Slater,” 
and Watts.?! — 

Work on Moray House Space Test I was initiated in 1944 by the late 
Professor Godfrey Thomson, assisted by J. H. Gray and J. Y. Erskine, and was 
continued by W. G. Emmett in conjunction with J. T. Bain and L. F. Mills. 
A factor-analysis of test-scores was begun by Professor Godfrey Thomson, 
carried further by Swanson, and completed by W. G. Emmett, using D. N. 
Lawley’s Method of Maximum Likelihood. Emmett found four significant 
factors and tentatively interpreted the first three of these as g, v and k. 

Meantime, the writer had included Moray House Space Test I in the battery 
of tests which he administered to some 200 eleven-year-old boys in June and 
July, 1947. Like W. G. Emmett, he first applied Thurstone's Centroid Method 
to the resulting correlations and then D. N. Lawley’s Method of Maximum 
uA. Using Lawley's test of significance he found only two significant 
mme ut he wu ne the second factor as a bipolar factor, the positive values 
P. tbi: e E a scene and the negative values spatial loadings. According 
; othe = con a ion. fhe draft of Spatial Test I had a very substantial spatial 
foadins (c.f, 18) ES ue of Moray House Space I had relatively low spatial 

as Gedy € same interpretation is given to the second factor 
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extracted from the Moray House correlation matrix by Lawley's method,? the 
same conclusion would follow regarding M.H. Space Test 1, i.e., both sections 
would appear to have relatively low spatial loadings. If, however, following 
Emmett, the third factor is interpreted as the spatial factor, both sections of the 
test and particularly the three-dimensional section would appear to have very 
substantial spatial loadings. J 
In view of the conflicting interpretations which have been given to the 
results of these factor-analyses, it was thought that it would be of considerable 
interest to compare the predictive values of the two spatial tests, when assessed 
under approximately the same conditions. The data were made available 
through the kindness of Mr. T. H. C. Walker, now Headmaster of Tynemouth 
Technical School, and Mr. Stanley Hirst, Director of Education for Middles- 
brough. The criteria, however, do not relate to the same samples of pupils. 
The ‘ follow-up ' for Spatial Test I was carried out with a 13-year-old group 
after a period of three years, that for M.H. Space Test I was mainly with an 
11-year-old group after a period of five years. 
` So far as the writer is aware, these periods are longer than those for which 
results have been previously obtained. According to the manual for M.H. Space 
Test I, assessments for seventy-one grammar school boys in geometry, science 
and art by the head teacher at the end of the first year were found to have higher 
correlations with the scores in M.H. Space Test I than with the verbal intellig- 
ence, English and arithmetic scores in the admission examination. Also in a 
secondary modern school, the correlations of M.H. Space Test I with assess- 
ments of forty-four boys after one year's work in science, technical drawing, 
woodwork and metalwork were found to be very satisfactory. In the present 
enquiry, the results relating to M.H. Space Test I show a somewhat similar 


trend. 
JI.—TESTS USED IN THE SELECTION PROCEDURE IN MIDDLESBROUGH IN 1050 
AND 1951 
For the present investigation, a ‘follow-up’ was carried out of pupils admit- 
ted to the Technical School in Middlesbrough in 1950fand 1951. The sample of 
pupils studied consisted of three groups : 
(i) A group of sixty 13-year-old pupils who had been tested for re-alloca- 
tion at 13 in 1951. These had taken the following tests : 
M.H. Intelligence (Adv. 9). 
M.H. English 18. 
M.H. Arithmetic 18. , 
N.F.E.R. Spatial Test I (Macfarlane Smith). 
(ii) A group of ninety 13-year-old pupils who had been tested for re-alloca- 
tion at 13in 1950. These had taken the following tests : 
M.H. Intelligence 29. 
M.H. English 12. 
M.H. Arithmetic 12. 
M.H. Space Test I (Bain and colleagues). 
(ii) A group of sixty 11-year-old pupils who had been admitted to the 
"Technical School after taking the following tests in 1951 : 
M.H. Verbal Reasoning 46. 
M.H. English 22b. 


M.H. Arithmetic 22. — 
M.H. Space Test I (Bain and colleagues). 
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The third battery of tests had been given to the whole of the 11 plus age- 
group. Together with teachers’ estimates and school recommendation lists, 
this had formed the basis of the selection procedure for all the selective schools 
in Middlesbrough. Pupils had been selected in the first place either for the 
Technical School or for the Hugh Bell Selective School on the basis of scores in 
the verbal intelligence, English and arithmetic tests, together with teachers’ 
estimates and school recommendations. 

Differences between the spatial test score and the verbal intelligence test 
score, considered in conjunction with the attainment test scores, were regarded 
as giving an indication of the suitability of a boy for one or other of these two 
schools. It was considered that a good spatial test score associated with a good 
arithmetic score was likely to indicate a greater measure of suitability for the 
Technical School, while on the other hand, a relatively good verbal intelligence 
score with a good English score was thought to indicate greater suitability for 
the Hugh Bell School. 


III.—CORRELATIONS WITH EXAMINATION RESULTS. 


Correlations with subsequent examination results in various technical 
subjects are shown in Tables | and 3. Since the tests being evaluated had been 
taken into account in the selection procedure, neither of them can be expected 
to reveal its full predictive validity. Thus, the spatial tests would have shown 
higher correlations with the criteria if the low scoring pupils had been admitted 
to the Technical School, and similar considerations apply, of course, to the other 


TABLE 1 
VALIDITY OF SPATIAL T: — 
SraATiAL Test 1 (N.F.E.R.). 
Follow-up after three years of pupils w] a battery of selection tests in 
thev were 13 years of age. F vho took a battery of selection tests 
' Correlations are shown between scores in indivi TM criteri suci 
i nati cores in al tests and criteria of suc 
technical examinations taken three years iip individu al tes! 


1951, when 


cess in 


M.H. " -— 

ET Spatial |Intelligence| M.H. SERENO 

Subject Y "Test 1 Test English Anithmess 
(N.FE.R)| Adv. 9 | Test 18 | Test 

(Verbal) 


Engineering Drawing 
(Internal exam., 1954) E. 4-027 


Metalwork (Theory and 
practical. Internal 
exam.,1954)  ...... 


Woodwork (Theory and 
practical. Internal 
exam., 1954) 


Metalwork (G.C.E. J.M. 
Board, 1954) 


Woodwork (G.C.E. J.M. 
Board, 1954) 


Geometrical Drawing 
(G.C.E., J.M. Board, 
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tests used in the admission examination. Since results were available only for 
the selected sample of pupils, the correlations are considerably lower than they 
would have been, if data had been available for an unselected sample. Correc- 
tions have not been applied, however, to allow for the attenuation which has 
resulted from the reduction in the range of scores, and it cannot be assumed that 
all tests have been equally affected. Regression coefficients and the correspond- 
ing multiple correlations with the various criteria are given in Tables 2 and 4.* 

Certain tentative conclusions can be drawn from the results. The correla- 
tions of the spatial tests with the criteria are in nearly all cases higher, and in 
some cases substantially higher, than the correlations of the other tests used in 
the admission examination. It is, therefore, probabie tnat in selecting pupils 
for courses of technical education, the inclusion of a spatial test in the selection 
battery would add to its predictive value and would result in the selection of a 
more promising sample of pupils. A comparison of the correlations for the two 
spatial tests suggests that Spatial Test I has, on the whole, higher validities 
after three years than has M.H. Space Test I after five years, but this conclusion 
must be regarded as only tentative since the selection procedures at 114- and 
13-- may not have been identical. Regression coefficients for both spatial 
tests were usually substantial and positive, while more than half of the regression 
coefficients for the verbal intelligence tests were negative. 

TABLE 2 
VALIDITY OF SPATIAL TEST 1 (N.F.E.R.). 


Best weightings of test-scores in the selection battery for predicting subsequent s 
in the examinations. The multiple correlation is given in the last column and the regression 
coefficients in the preceding four columns. 


M.H. M.H. 
Spatial | Intell. M.H. Arith- | Correla- 
Subject Test 1 Test English metic lation 

FE Adv. 9 | Test 18 | Test 18 
! (Verbal) 


REGRESSON COEFFICIENTS 


Engincering Drawing (Intern- . 
alexam., 1954) OS — -036 


Metalwork (Theory and Prac- 
tical. Internal exam.,1954) 


Woodwork (Theory and Prac- 
tical. Internal exam.,1954) 


Metal 
1984) m2 2 ré 7:362 | 2.768 


+ +433 


Geometrical Drawing (G.C 


J-M. Board, 1954) 4--866 


Mean multiple correlation — + -637. 


* The correlations and multiple regression coefficients shown in Tables 1, 2, 3 and 4 
‘ere calculated on a Pegasus Computer using the Multiple Regression Analysis Programme 
Msi TN. prepared by S. A. Robinson and R. J. Taylor (British Iron and Steel Research 
Vssociation, List No. 101, 1959). 


142 Prediction of Success on Technical Courses 


TABLE 3 
Vatipity oF Moray House Space Test 1. 


(a) Follow-up after five years of pupils who took a battery of selection tests in 1951 
when they were 11 years of age. Correlations are shown between scores in individual tests 


and criteria of success in technical examinations taken five years later. 


M.H. M.H. 
Subject Intelligence} English 
Test 46 Test 22b 
(Verbal) 


Engineering Drawing 
(Internal exam., 1956) 


Engineering Drawing 
(G.C.E. Assoc. Exam. 
Board, 1956) .... 


Metalwork (G.C.E. Assoc. 
Exam. Board, 1956).. 


Woodwork (G.C.E. Assoc. 
Exam.Board, 1956) 


Metalwork (Theory and 
practical. Internal 
exam., 1956) 


(b) Follow-up after three years of pupils who took a battery of selection tests in 1950 
when they were 13 years of age. 


M.H. M.H. M.H. M.H. | 

Space Intelligence English Arithmetic 

Test 1 Test 46 Test 12 Test 12 
(Verbal) 


Subject 


Woodwork (Theory and 
practical. Internal 
exam., 1953) 


Woodwork (Practical 
only. Internal exam., 


zd 
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TABLE 4 
VaLrmprrY oF Moray House Space Test 1. 
Best weightings of test-scores in the selection battery for predicting subsequent 


suc in the examinations. The multiple correlation is given in the last column and the 


regression coefficients in the preceding four columns. 
(a) Pupils tested in 1951 at age 11. 


M.H. M.H. M.H. |Multiple 


. Space | Intell. | English | Arith- | Corre- 
Subject D s Test 46 | Test 22b| metic 
(Verbal) Test 22 


REGRESSON COEFFICIENTS 


Engineering Drawing (Intern- | 
al exam., 1956) * —-083 | —-634 | +-301 


Engineering Drawing (G.C.E. 
Assoc. exam. Board, 1956) 2 — «658 1-501 -186 


Metalwork (G.C.E. 
exam. Board, 1956) 2 : + -003 


Woodwork (G.C.E. ; 
exam. Board, 1956) 


Metalwork (Theory and prac- 
tical. Internal exam.,1956) 


(b) P 


M.H. M.H. M.H. M.H. |Multiple 
Space ll. | English | Arith- | Corre- 
Test 1 st 29 | Test 12 | metic lation 
Test 12 with 
Criterion| 


Subject 


REGRESSION COEFFICIENTS 


Woodwork (Theory and prac- 


tical. Internal exam., 1953) — +367 | —:426 -223 


Woodwork (Practical only. " 
Internal exam., 1953) ...- + 650 —:190 | —-374 +229 


Mean multiple correlation = -+ -499. 


In conclusion, some correlations with attainment in geometry are given 
for a class of 15-year-old grammar school pupils, though the data were not 
obtained in connection with the Middlesbrough investigation. The correla- 
tions, shown in Table 5, indicate the relationship between scores on Spatial 
Test I and the Manchester General Ability Test (Sen.) I ?' and scores on the 
Walton Geometry Attainment Test. The latter is an objective test consisting 
of sixty-si3 geometrical problems, in which the information given for cach prob- 


lem is supplied in the form of an annotated geometrical diagram. 
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The correlations in Table 5 suggest that Spatial Test I (N.F.E.R.) is a 
better indicator of ability to solve geometrical problems (riders of the conven- 
tional type) than is an advanced verbal intelligence test of the type of Manchester 
General Ability Test (Sen.) I. Thus, there are grounds for supposing that 
Spatial Test I might have value for predicting success in courses in mathematics 
in grammar schools. In a recent N.F.E.R. publication,” Yates and Pidgeon 
have supplied evidence showing that Spatial Test I can make a significant 
contribution towards predicting grammar school success. The best prediction 
of success in the grammar schools included in the investigation (some 800 pupils) 
was given by the combination of scores 4P.H.A.+2E+Sp. I where P.H.A. is 
Primary Head Teacher's Assessment, E is a special English Test (by A. F. 
Watts) and Sp. I is Spatial Test I (by the writer). This weighted battery gave 
the best correlation with the criterion, namely -931. It is possible that this 
result may indicate that spatial ability is important for success in mathematical 
and scientific subjects. 7 


TABLE 5 


CORRELATIONS OF SCORES FOR TWELVE Boys AND TWELVE GIRLS IN THE AGE-RANGE 
14:7 to 15:6. 
(Class IV B in a Grammar School.) 


Spatial Test Manchester Geometry 
1 (N.F.E.R.) | General Ability Attainment 
Test (Sen. 1) ‘Test (Walton) 


Spatial Test 1 (N.I.E 


—À - -569 6110 


Manchester General Ability Test 
| (Sem) li  asmxorepusewn tea 


| Geometry Attainment Test (Walton) 


IV.—CoxcLv 
A‘ follow-up ' over periods of three ani 
g oups o pupils who entered the Technical 
^ Correlations have been obtained suggesting that two spatial tests (N.F.E.R. 
oe: I Ew M.H. Space Test I) e a better indication than other selec- 
n t of subsequent performance in technical subjects such as engineering. 
awing, metalwork and woodwork. This was found to be true whether the 
criterion was based on internal examinations or on external G.C.E. examinations. 
: Regression coefficients and multiple correlations with the various criteria 
were also calculated. The regression coefficients for both spatial tests were a 
most cases substantial and positive, while more than half of the regression 
coefficients for the verbal intelligence tests were negative. j 
__ Evidence has also been obtained suggesting that Spatial Test I (N FER) 
is a better measure than a verbal intelligence test of abilities which enter into: 
attainment in geometry. This result was obtained with a fourth form o 
twenty-four pupils. It may partly account for the finding reported by Yates 
and Pidgeon that Spatial Test I, when used in conjunction with other tests, 
has been found to make a significant contribution towards the best prediction 
of all-round success in the grammar school. 


ars has been carried out on 
Middlesbrough in 1950: 


p. 


I. M. SMITH. I45 


ACKNOWLEDGEMENTS.—The writer is greatly indebted to Mr. T. | 
Walker, Headmaster, Tynemouth Technical School, who was responsible for 
supplying most of the data in the first place ; also to Mr. Stanley Hirst, Director 
of Education for Middlesbrough, who supplied information regarding the selec- 
tion procedure employed in Middlesbrough in 1951 and who readily answered 
various queries. 

Tribute must also be paid to the staff of Durham University Computing 
Laboratory and to one of the writer's former students, Mr. R. Y. Anderson, 
who assisted in various ways in connection with the calculation of correlation 


coefficients. 


V.—REFERENCES, 


1 Bary, J. T. (1946). The Construction of a Space Test. B.Ed. Thesis, Moray House» 
University of Edinburgh. 2 

* BRADFORD, E. J. G. (1948). Symposium on the selection of pupils for different types of 
secondary schools. IV—Àn experimenter's point of view. Brit. J. Educ. Psychol., 
18, 67-84. 

3 DEMPSTER, J. J. B. (1950). An Investigation into the Use of Verbal and Spatial Tests in 
the Selection of Boys for Grammar and Technical Schools. Ph.D. Thesis, University 
of London. 

* Drew, L. J. (1947). The measurement of technical ability. Occup. Psychol., 21, 34-48. 

è Emmett, W. G. (1949). Evidence of a space factor at 114- and earlier, ^ Brit. J. Psychol. 


(Stat. Sect.), 2, 3-16. , T 5 E rud 
* HorLIDay, F. (1940). The selection of apprentices for the engineering industry. Occup. 


hol., 14, 69. n " 
: ioci d A. H., El (1935). The visual perception of Space. Brit. J. Psychol., Monogr. 


Suppl., 20. un: a - ia 
* LAwrEY, D. N. (1943). The application of the maximum likelihood method to fa -tor 


analysis. Brit, J. Psychol., 33, 172-175. E 3 

* Mixts, L: F. (1947). The Properties of a Space Test. B.Ed. Thesis, Moray House, Univ- 
ersity of Edinburgh. " 

1 PEEL, E. A. (1949). Evidence of a practical factor at the ageofll. Brit. J. Educ. Psychol, 

, 1-15. 7 

a LM C. W. (1942). Tests of,technical aptitude. Occup. Psychol., 16, 175-182, 

1 SLATER, P. (1940). Some group tests of spatial judgment or practical ability. Occup. 
Psychol., 14, 40-55. T 

9? SLATER, P., and BENNETT, ELIZA) 
its relation to some educati 

1 SMITH, I. MACFARLANE ( 
of Glasgow, 1937. 

15 SurrH, I. MACFARLANE (194: 
Psychol., 22, 150-159. 

18 SurrB, T. MACFARLANE (1950). 


Newnes. 
1 SmrTH, I. MACFARLANE (1952). An I nvestigation into the Problem of Measuring Spatial 


Ability in Pupils of 11 to 12 and of Devising a Reliable Test for Selecting for Technical 
Education, together with a Study of the Personality Qualities of Pupils so Selected, 
Ph.D. Thesis, University of Manchester, 

Lo SIN, 1 MACEARUANE (1954). The development of a spatial test. Durham Res. Rev., 

28) lios R. D. (1947). Geometry Attainment Test and Manual, University of London 

Tess. 

2 Warts, A. F., and SLATER, P. (1950). The Allocation of Primary School Leavers to Courses 
of Secondary Education. Published for N.F.E.R. by Newnes. 

?1 Watts, A. F. (1953). A group performance test. Bulletin of N.F.E.R., 2, 15-21. 

22 WISEMAN, S., and COLLEAGUES (1960). The Manchester General Ability Test (Sen.) 1. 
University of London Press, ay: 

* YATES, A., and PrpGron, D. (1957). Admission to Grammar Schools. Published for 


N.F.E.R. by Newnes. 


(1943). The development of Spatial judgment and 
roblems. Occup. Psychol., 17, 139-155. 

e Form Perception Factor. Ed.B. Thesis, University 
yin Brit. J. Educ. Psychol., 3, 1938.) 

Measuring spatial ability in school pupils, Occup. 


Spatial Test 1 and Manual. Published for N.F.E.R. by 


(Manuscript received, August 19th, 1959.) 


SYMPOSIUM: CONTRIBUTIONS TO THE DIAGNOSIS AND 
AND REMEDIAL TREATMENT OF READING 
DIFFICULTIES 


I—TuE INVESTIGATION OF READING PROBLEMS Topay.* 


By M. D. VERNON 
(Department of Psychology, University of Reading.) 


Summary. In spite of the large number of studies in recent years of problems 
connected with the processes of reading and with inability to learn to read 
normally, these problems are still very imperfectly understood. Although 
they are undoubtedly complex, failure to solve them appears to be due in 
part to the inadequacy of the methods employed in investigating them. 
Experimental studies of methods of teaching reading are often not suitably 
controlled, nor carried out for a sufficient length of time. The various factors 
associated with different types of reading backwardness are not always properly 
measured, nor their incidence and frequency assessed in different individual 
cases. The causal relationship between these factors and reading backwardness 
remains obscure. In part, these failures may be due to the fact that insufficient 
attention has been devoted to the nature of the reading processes and of their 
breakdown in backward readers, 


I.—INTRODUCTION. 
Tue problems of learning to read, and of the failure to learn to read normally, 
continue to attract extensive study. Yet, within the last few years there 
appear to have been few investigations which have added greatly to our under- 
standing of these problems. We still know little as to the nature of the blockage 
which seems to prevent some children from learning to read normally, and 
particularly from acquiring the ability to recognize words. Nor do we know 
which are the processes which fail to develop. Some new evidence has been 
advanced as to the significant factors which characterize backward readers as 
contrasted with normal readers. But it is seldom clear to what extent these 
factors actually cause backwardness ; or whether they are merely associated 
with it, and have no causal relationship with it. 

Now it is inevitable that there should be considerable difficulty in establish- 
ing clearly principles as to the optimum methods of teaching reading, and the 
causes of failure, since the number of factors which affect learning to read is 
very large, and many of them are impossible to control experimentally. How- 
ever, far more might be discovered if the experimental methods were better 
adapted to the type of study. But, although statistical techniques for the 
treatment of results have been refined and elaborated in recent years, there has 
been little improvement in the techniques of investigation. It may well be that 
it is for this reason that so little new light has been thrown on these problems. 


IL—READING SURVEYS. 

No doubt, large-scale surveys of reading performance, such as that of 
Morris in Kent (1959), are of value to local education authorities in giving them 
some idea as to the reading achievement of children in their locality, and of the 
kind of conditions in which this is most and least satisfactory. In several recent 

. ," Based on a paper read to the South-Western Branch of the British Psychological 
Society in October, 1959. 
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surveys of this kind, the percentage of cases of severe i c 

has always appeared very small. Yet, in spite of dues do iere mec 
teachers continue to complain about the children who do not learn gem 
normally, or fail to learn at all ; and such cases are still frequently refered 6 

educational psychologists and to clinics. Indeed, the Morris survey showed tl t 
about 20 per cent. of the 7-year-old children were still at the ‘ first primeri st = 
or below, meaning that they were poor readers or non-readers. No doubt hum 
is a large amount of comparatively temporary backwardness, much of whi his 
relieved by suitable teaching ; and, in some cases,the children may grow d " 
it themselves. We never learn, however, the amount of severe Senter 
backwardness. How many children, apart from low-grade defectives “te 
learn to read at all? And how many learn just enough to read simple de in 
and instructions, the betting news and the football pools, but cannot read a 
continuous text? This information should not be impossible to obtain; but 
until we have it, we cannot tell whether illiteracy is still a serious problem in 
this country ; whether or not the current methods of teaching reading are about 
as effective as they can be; whether remedial teaching is overcomin och 
difficulties as normal school teaching cannot be expected to meet ; and heth T 
there is a considerable residue of adults and children who for some cause onteide 


the teaching situation never learn to read normally. 


IIT. TECHNIQUES FOR INVESTIGATING THE EFFECTS OF TEACHING METHODS. 
Not only do methods of investigation in current use fail to disclose the amount of 
permanent illiteracy in the population ; they also do not give us much reliable 
information as to the best methods of teaching reading and the probable causes 
of reading difficulty. Two main types of technique of investigation have been 
employed. Thc first, used for discovering the best methods of teaching reading, 
is the method of comparing the reading achievement of classes, either remedial 
or normal school classes, taught by different methods. In general, the results 

if the children are specially selected and 


are likely to be more satisfactory h 
matched for intelligence, socio-economic status, etc., and the teachers and 


methods also chosen for the purpose. If already existent ordinary classes are 
studied, the number of variables relating to the children, the school conditions 
etc., is so large as either to invalidate the results or make them impossibly 
complex to interpret. Thus, in Morris's survey, it appeared at first sight that 
children in schools where phonic teaching was introduced at a very early age, 
showed a better reading achievement in the primary school than children from 
schools in which phonics were introduced later. But this difference was signi- 
ficant only when allowance was made for the fact that the non-verbal intelligence 
of the children in the former schools was lower than that of the children in the 
latter. This somewhat surprising finding makes it difficult to decide whether or 

On the other hand, until the 


not the phonic methods were really superior. 1 
effects of intelligence were partialled out, it appeared that size of school, urban 


locality and socio-economic status all correlated positively with reading achieve- 
ment : but this result must be interpreted as showing only that more intelligent 
children, who have a higher average reading achievement, are found in better- 


class urban neighbourhoods where there are large schools. (It is only fair to 
note, incidentally, that Morris has since been carrying out intensive studies of 
various factors in classroo has teacher training, which, it appears, 


m teaching, suc: 
may be of considerably more impo! than the use of phonic or look-and-say 


à rtance t 
in enabling children to learn to read without difficulty.) 
It is perhaps worth comparing these findings 


with those of Kemp (1955) 
which gave rise to some comment a few vears ago. His method was to test the 
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achievement in reading and in other school subjects of a group of thirty 
11-year-old children from each of fifty mixed junior schools in the London area. 
There was no special design as to teachers and teaching methods. Here again, 
the relationships between children’s intelligence and socio-economic status, and 
the size of school and class, seemed to be so inextricably intermingled that it was 
difficult to say if any of these factors, apart from intelligence, had any effect 
on reading achievement. 


Much clearer results are obtained if some attempt is made to control these 
variables, and to measure the achievement of small groups of children specially 
selected, and with teaching methods as far as possible controlled. A good 
example of such an enquiry was that of Daniels and Diack (1956) into the 
efficacy of their modified phonic method of teaching backward 8-year-olds to 
read, as compared with the efficacy of a ‘ mixed’ method. But, even in this 
enquiry, which appeared to show an undoubted superiority for the phonic 
method, the competence of the teachers of the two types of class was not neces- 
sarily the same, thus introducing an important variable. Only by repeating 
such an experiment many times with teachers of varying competence in both 
types of class could this variable be compensated for. Moreover, some con- 
clusion might then be possible as to the ability of teachers with varying degrees 
of competence to utilize the various teaching methods. 


_Ace (1956) attempted to get over the difficulty of varying competence by 
having the same teacher give remedial reading teaching to two groups, one by 
means of the * Moxon ' method and the other by ‘ mixed’ methods. The former 
gave the better results. But even if this teacher was able to apply both methods 
with the same degree of competence and enthusiasm, it is still not justifiable to 
generalize from the findings that remedial teachers in general would obtain 
better results with the ‘Moxon’ method. 


The efficiency of the various methods is commonly assessed by comparing 

“the average test performances of the children taught by them. Valuable 

supplementary evidence could be obtained, however, by discovering how many 

failed to profit by them, and continued to show an appreciable degree of back- 

wardness. Indeed, it is probably more important to know whether a method 

is generally fairly successful (i.c., results in few cases of backwardness) than 
whether it produces a slightly higher average achievement. 


Another criticism which must be adv ainst experiments of this 
kind is that they are seldom carried out fee pude peno ; and the same 
is true with regard to the success of remedial teaching in general. We are, for 
RT HIPH. hatat the end of three months or six months, the children 

aught by one method have improved their reading to a greater extent on the 
er method. Now Sparks and Fay (1957) 
ght to read by means of a method called 
ie rapid progress at E ie a grove 
, Contained less phonics. By the end of the 
an ia however, there was no significant fference in achievement 
fülloweua in the wae But more positive results were obtained by means of a 
effect oft fresas well-known enquiry of Gardner (1950) which compared the 
school. The Pr dedu methods with that of formal ’ methods in the infant 
ad tà for th w-up, when the children were 9 or 10 years old, showed an 
vantage tor the "free expression ’ methods, which it was said produced no 
reading failures and a higher average reading achievement. But these findings 


were probably affected by differences in th i ít 
respective teachers. TESRE aaa 
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Again, we may be informed that the effect of a three or six-month period of 
remedial reading teaching has been to raise the children's average reading age 
bya year or more. But we are not justified in concluding that the same improve- 
ment is maintained indefinitely, and that the children eventually become normal 
readers. Thus, Collins (1956) compared the improvement in word recognition 
of three matched groups of backward readers : (1) attending a remedial educa- 
tion, centre; (2) given remedial teaching in school; (3) a control group. At the 
end of a six-month period, the first group showed a significantly greater improve- 
ment than the second and third groups, who were not significantly different 
from each other. But follow-up tests two or three years later showed no 
difference between the groups ; the first group had slowed down and the other 
two had caught them up. There is, indeed, evidence from other sources that 
the long-term effect of remedial reading teaching may be rather slight, unless 


the teaching is greatly prolonged. 

There are certain circumstances in which, however, a comparatively brief 
follow-up may give useful results. Roslow (1940) studied the effect of ‘ reading 
readiness’ activities in the kindergarten on the progress in reading of children 
when they entered Grade I of the ordinary school. The children were at this 
divided into three groups according to their mental age and their aptitude 
for reading as assessed by ' reading readiness ' tests ; and they were then taught 
along lines indicated by their performance on the tests. It was found that the 
majority of them made satisfactory progress during their year in Grade I ; and 
it could be assumed with some justification that they would continue to do so. 
Another comparatively short-term study was made by Mitzel and Medley (1957), 
of the relative degree of improvement shown by children taught by different 
teachers over a six-month period. It was found that the average differences 
between children in different classes in the same school were almost as great as 
those between children in different schools. This finding is valuable in that it 
shows the effects of different degrees of competence in the teachers over quite a` 
short period of time. However, it would also be interesting to know whether the 

behind through incompetent teaching caught up again if 


children who lagged 
vere subsequently taught by better teachers. 


point 


they v 


IV.—STUDIES OF GROUPS OF RETARDED READERS. 

We must now consider the reliability of the method most commonly used 

for investigating the causes of failure in learning to read—namely, the selection 
of a group of children retarded in reading and the comparison of their character- 
istics with those of a group of normal or superior readers. Numerous criticisms 
e adopted in many of these investigations, In the 


may be made of the procedur 
first place, the degree and nature of the backwardness may be very variable. 
Often the chilldren are said to be one year or a year-and-a-half below the normal 


reading test performance for their chronological age, their mental age or even 
their ‘grade.’ The last is a particularly unsatisfactory measure, since mental 
age within a grade may vary by five years or more. Even if reading performance 
is related to mental age, this mental age differs according as to whether intellig- 
ence has been tested by a group written test, an orally given verbal test or a 
non-verbal test. Obviously, the first of these is quite unsatisfactory for retarded 
readers, and, indeed, impossible for non-readers. But it has been argued that 
the non-verbal tests are unsatisfactory also in that they give no measure of the 
child’s general ability to use Janguage. Probably the most satisfactory assess- 
ment is given by the WISC, which measures both oral verbal and non-verbal 
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intelligence, and thus affords a measure of retardation below both the verbal 
and the non-verbal M.A. But individual intelligence tests are rejected by 
many experimenters as being too time-consuming. 

However, the criterion of retardation as a performance one year or a year- 
and-a-half below the standard reading test performance is not in any case a good 
one. It represents an attempt to quantify on a single scale performances which 
may, in fact, be qualitatively different from one another : that of the child who 
cannot read at all, or can merely recognize a few words he has learnt by heart ; 
and of the child who knows how to read new words, but perhaps reads only one 
word at a time, and fails to understand their meaning. The latter type of back- 
wardness, tiresome as it may be in school, should be considered as falling into a 
different category from the former. It is possible that the underlying causes 
of these conditions are different ; but this we cannot tell until a clear distinction 
is made between their incidence. 

When experimenters endeavour to judge the effect of various causal factors 
in producing reading backwardness, they may find some difficulty in deciding 
on the best methods of assessing these factors and their association with the 
reading backwardness. The usual practice with discontinuous functions, such 
as laterality, is to compare the frequency of occurrence of these in comparable 
normal and backward readers by means of a contingency coefficient. But the 
urge is strong to convert such measures into indices which can be submitted to 
more elaborate parametric statistics. Thus, even with laterality some experi- 

R-L* 
RHL” RFL’ 
measure degree of laterality. With other less obviously discontinuous functions, 
the tendency is stronger. There is no a priori reason why actual test scores, say 
of perceptual span, should not be correlated with scores on reading tests; or 
converted into averages, and the significance of the difference of these calculated 
as between backward and normal readers. But there is, nevertheless, the 
considerable disadvantage in these procedures that we cannot tell whether a// 
the backward readers are to some extent deficient in these functions; or 
whether some are highly deficient and others normal, which is much more 
probable. Therefore, even in these cases, it may be better to determine the 
frequencies of low scores, and hence demonstrate how characteristic this defect 
is of backward readers. 

_ The calculation of averages or correlations of grades on various qualities 
which cannot be scored numerically appears not only to suffer from the dis- 
advantage mentioned, but also to be statistically suspect. It might be better 
to submit these to some form of non-parametric statistics, since they are neither 
continuous nor linear functions. This point may be illustrated by reference to 
the recently published monograph by Malmquist (1958), which describes a very 
detailed and far-reaching study of the reading performance of 400 first-grade 
Swedish children. These children were divided on the basis of reading test 
scores into three groups: poor readers, who scored 1e or more below the norm ; 
good readers, who scored lo or more above the norm ; and medium readers, 
from +-1 to —le. Incidentally, very few of these children appear to have been 
non-readers ; perhaps the Swedish language is comparatively easy to learn to 
read, in the early stages, and there ave very few non-readers. Malmquist 
determined the frequency in the three groups of a number of factors such as : 
social group, parents’ education and other social indices (significant association 


menters have concocted an index, r which is supposed to 


* R and L being the number of activities in which the right or left hand, foot or eve is 
preferred, respectively. 
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with poor reading); prematurity of birth (not significant) ; speech defects 
(significant) ; visual defects (not significant) ; laterality (not significant) ; 
pre-school diseases (not significant). She also used this method for certain 
personality factors, such as nervous symptoms reported by parents during the 
pre-school period, and found the latter to be more frequent for the good than the 
poor readers. But for personality qualities shown in school, such as concentra- 
tion, persistence, self-confidence, dominance-submissiveness, stability-nervous- 
ness, she obtained ratings from the teachers on a five-point scale, correlated 
these ratings with reading scores, and also calculated the significance of the 
differences between the average ratings for the three groups. These were said 
to be significantly different, and the correlations were of the order of 0-3—0-4. 
These methods are some of the most frequently used in reading studies ; 
but what do they prove to us? There is somewhat less of these desirable 
qualities among the poorer than among the better readers. But to what extent 


is each of these qualities defective in how many backward readers? Further- 
istence and emotional stability the cause 


more, were lack of concentration, persi 

or the effect of difficulty in learning to read ? Among older children particularly, 

many of the symptoms of emotional maladjustment may be caused by their 

failure in reading. In first-grade children it should be possible to distinguish 

between cause and effect. Yet apparently among Malmquist's children, the 

poor readers had fewer nervous symptoms before coming to school, but were 
in school, once they had begun learning to read. If the evidence of 


less stable in sc 7 
the teachers’ ratings can be accepted, it would seem that reading backwardness 


had produced this loss of stability. 

‘Another instance of the difficulty of distinguishing cause and effect is 
given by the frequently quoted association between visual perceptual ability 
and reading achievement. Both Malmquist (1958) and Goins (1958) found that 
among first-grade children good readers performed better than poor readers on 
tests of perception. But the superiority seems to have been most apparent in 
the perception of digits and letters, in Malmquist’s experiments ; and in any 
case, was largely a function of superiority in intelligence. In Goins’ experiments 
the superiority was in pattern copying and absence of reversals in perception. 
But may not these superior performances have been due, at least in part, to the 
reading (and writing) achievement of the better readers, which had taught them 
the necessity of distinguishing small details accurately ? For when Goins went 
on to determine the effect on reading of practice in the tachistoscopic perception 
of figures, digits, letters and words, she found that, although tachistoscopic 
perception improved considerably, the effect on reading was inconsiderable. 

In another recent survey of the possible causes of reading backwardness, 
there has been a real attempt to determine not only the association between 
certain factors and backwardness, but also which of these factors may be causal. 
Stott (1959) examined the records of illness in early childhood, the personality 


characteristics and the reading backwardness of an unselected sample of 142 


children who had spent at least 2 weeks in hospital during infancy. He also 
compared them with a control group of children of the same age and sex who 
were not known to have been in hospital. The hospitalized children were more 
backward than the controls, and their backwardness was also associated with 
the number and type of diseases from which they had suffered, and with a 
personality characteristic termed ‘ unforthcomingness,’ which seems to denote 
an impairment of motivation and inability to attempt actively to overcome 
difficulties. In an earlier study (1957), Stott had established an association 
between maternal stress during pregnancy, frequent early illness and low intellig- 
ence; and between backwardness in school and ‘ unforthcomingness '. The 
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general conclusion from the present survey was that stress during pregnancy 
might set up a general syndrome, which could be termed ' weak constitution, 
giving rise to frequent illness, mental retardation and unforthcomingness ; 
but that it was also possible that the retardation might be enhanced, if not 
caused directly, by defective motivation, which, in turn, might have resulted 
from the early illness. The factors of social class difference, and of separation 


from the mother in infancy by hospitalization, were found not to be of import- 
ance. 


Clearly much further study is required to define the exact effects of these 
various factors, though undoubtedly the greatest impairment may be produced 
by a multiplicity of causes. It is often difficult to follow Stott's reasoning as 
to which was cause and which effect. Nor did he take into account the existence 
of ‘ maternal over-protection ' which may very well affect children who have 
suffered much illness in early childhood, and which almost certainly tends to 
aggravate the personality quality he terms ' unforthcomingness.’ Again, he 
appears to equate poor reading achievement, general educational backwardness 
and low intelligence. Though some reading backwardness may be associated 
with low intelligence, we know that some of the most severe cases are of normal 
intelligence. But Stott's measurement of reading achievement was too rough to 


pick out such cases. Thus, it may well be that they are quite distinct from the 
cases of general subnormality studied by Stott. 


There are also cases in which it appears that there is unlikely to be any direct 
association between a certain factor and reading achievement; but that the 
association is with some other factor or factors not measured by the experi- 
menter. This seems to have been the case in the study carried out by Lynn 
(1955), which appeared to demonstrate that anxiety might improve reading 
rather than impairing it. Eighty junior school children, aged 71-11 years, 
selected to include good, average and poor readers, were asked questions drawn 
from the Taylor Anxiety Scale about personal anxieties such as fear of the 
kd grann, dislike of being alone, etc., an 
anxieties such as fear of animals, thunder, ni . Signi rrela- 
tions were obtained of 0-33 between re ee 
and of 0-24 with impersonal anxietie 


that anxious children ‘ withdraw ’ into 


ious to succeed in the 114+ examination, It 
would, however, be extremely unwise to d ildren i 
general are good readers, or that thei Senet era Ks A 
It seems far more probable that 
correlated with some other quality, 


V.—Inpivipvar STUDIES. 
It seems possible that it is on] 
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study by Spache (1957) using the Rosenweig Picture Frustration test showed 
that the responses of backward readers (of various ages) could be grouped into 


those of : 
(1) The overtly hostile, aggressive and intolerant. 
2) Those below normal in hostility, excessively apt to accept blame. 


(2 

(3) The defensive, hyper-sensitive to criticism. 

(4) Those who tended to withdraw in the face of difficulties. 

(5) Those who would not accept failure, but drove themselves on in spite 

of all obstacles. 
These observations, in so far as conclusions based on a single projection test 
can be accepted as valid, demonstrate patterns of personality which could be 
looked for in individual studies also ; and if they appeared, could be related to 
environmental factors such as parental treatment. They may, of course, have 
been caused by reading failure in some cases, rather than having caused it; but 
adequate case histories and early testing might throw some light on this. 

One method of clasifying the data from individual cases is to record in 
tables the presence or absence of relevant factors in each case. It is interesting 
to note that Malmquist (1958) adopted this procedure with thirty-four of her 
children of normal or above normal I.Q. whose reading performance was 
markedly inferior to that expected for their M.A. But she made no further use 
of these data. From a casual inspection, these cases appear to have come mainly 
from families of low socio-economic status, the parents having themselves had 
little education ; they suffered from few speech or other defects, but in many 
cases had had severe or prolonged illnesses before coming to school ; on the 
whole they possessed normal sociability, but many were lacking in persistence, 
concentration and emotional stability. This suggests a picture somewhat 

d and physically sub-normal 


resembling Stott's of the impoverished, unculturei i i 
child who often finds difficulty in starting learning and tackling difficult tasks 


when he comes to school; but who, in many cases, has little prolonged or 
severe reading disability. 'As I mentioned above, there seem to have been few 
of the latter cases in Malmquist's sample. A tabulation of severe cases—non- 
readers, for instance—would probably show different patterns of symptoms and 
causes. It would, of course, require a much more detailed diagnosis of person- 
ality qualities and characteristics of the home background than that obtained 


by Malmquist. 
Little evidence has been brough 


or nature of the disorder sometimes ca 


t forward in recent years as to the incidence 
lled ‘ specific dyslexia,’ characterized in 
some cases, though not in all, by language deficiencies, motor inco-ordination, 
incomplete laterality, hyperactivity and some emotional instability. A study 
by De Hirsch (1954) of children of this type showed that they may also be 
deficient in figure-ground differentiation and in the ability to form abstract 
concepts—though to a less extent than are cerebral-palsied children. This 
suggests some form of neurological deficiency in the ‘ specific dyslexic.’ But 
positive evidence corroborating this hypothesis is lacking, and is, of course. 


very difficult to obtain. 


VI.— CONCLUSIONS. 


It may well be that no definite evidence as to the causes of reading disability 
will be forthcoming until more is known as to the fundamental nature of the 
disability—exactly what it is that these children cannot do. At present, it 
appears that there must exist some failure in reasoning related to the use of 
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language which precludes these children from analysing the printed words, 
systematically associating sounds to the constituent letters and synthesizing 
these to form the total word sound. But much more detailed experimental 
study of the processes of reasoning in backward readers is necessary to elucidate 
the exact nature of the failure. 
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II—THE EFFECT OF PRACTICE ON PERFORMANCE IN SCHOLASTIC 
TESTS. 


By W. CURR anp N. GOURLAY 
(Education Department, University of Birmingham.) 


Summary. 1.—The main object of the investigation was to obtain some 
measure of practice effect for tests in mechanical reading, reading comprehen- 
sion and arithmetic. Standardised tests in these skills were given at varying 
intervals of time to groups of 9-10-year-old children in four primary schools. 

2,—The statistical analysis employed provided estimates of the mean 
gains both at the average level of ability for the age group (9-5 years) and two 
years below average (7:5 years). 

Practice effect was greatest for reading comprehension. At the 9-5 level, 
a re-test after one month produced a mean gain of 10-1 months and further 
re-tests after 3 and 6 months gave gains of 18:2and 26-9 months, respectively. 
The corresponding figures at the 7-5 level were 8:9, 10:4 and 12-2 months. 

For the mechanical reading test, the gains were 5:3, 8-0 and 14-1 months 
at the 9-5 level and 4-4, 6-2 and 11:8 at the 7-5 level. Practice effect was only 
slight for the arithmetic test. 

3.—Practice effect was also estimated for other cases. In particular, 
pupils tested on both forms of the reading comprehension test at an interval of 
one month registered gains of 7-2 and 4-2 months at the 9-5 and 7:5 levels, 
respectively. 

4.—Reference was made to our earlier investigation into the scholastic 
results of remedial education. It was pointed out that, in the light of the data 
here presented, the gains made by the controls in our experiment could be 
explained entirely in terms of maturation and practice effect. Data are also 
given to show the impermanence of the net gains made by the pupils receiving 
remedial education. Corroborative data from another investigation are also 


presented. 


I.—INTRODUCTION. 

Tus investigation is a sequel to an experiment in remedial education reported 
a few years ago in this Journal) Earlier attempts at assessing the scholastic 
results of remedial education had used a simple test-retest procedure. We 
maintained that gross gains, as measured by tests, were inflated by a number of 
factors, in particular (a) practice effect ; (b) regression effect ; and (c) a coaching 
effect ; and that these factors must be allowed for before a valid assessment of 
the efficacy of remedial education can be obtained. 

In our experiment, we allowed for the first two factors by introducing a 
control group of pupils who received no special instruction but who were 
subjected to exactly the same testing programme as the remedial group. The 
' true ' gain of the remedial group was then defined as the amount by which the 
gross gain of the remedial group exceeded the gross gain of the controls. (We 
attempted to demonstrate the coaching effect by showing that the ' true ' gain 
for reading varied with the type of reading test used). 

From the standpoint of the remedial educationist, the results of our experi- 
ment were disappointing. The gross gains of the remedial group were comparable 
with the gains reported for other investigations but, since appreciable gains 
were also made by the controls, the ' true’ gains were small. It was only in 
reading, as measured by a Schonell Reading Comprehension test, that a stat- 


istically significant ‘ true ' gain was obtained. 
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Obviously, the results of our experiment would have been more favourable 
to remedial education if the gross gains of the controls had not been so large. 
The suggestion was, therefore, put forward by Birch? and supported later by 
Pringle and Gulliford? that either the class teachers had paid special attention 
to the control pupils or they had been so influenced by the experiment that they 
were able to bring about substantial over-all improvement in their class work. 
We, of course, favoured the simple explanation that the gains of the controls 
had been inflated mainly owing to the effect of practice in the tests used but, at 
the time, we had no practice effect data to confirm this. 

The present investigation was, therefore, carried out in order to obtain some 
measure of the amount of practice effect which can occur when the common types 
of scholastic test are given at varying intervals of time. Also, in view of the 
likelihood of differential practice effects, it was intended that the investigation 
should provide a measure of practice effect not only at the average level, but at 
the level of the pupils to be found in remedial groups. 

‘A section on coaching appears at the end of the paper in which, as a result 
of further data, we broaden the conception of coaching given in our earlier 
study. The section has been added in order to indicate more fully our present 
position in regard to remedial education. 


IL—PLAN OF INVESTIGATION. 
(i) Selection of subjects.— The entire testing programme was carried out in 


four Birmingham primary schools by four members of the Diploma course in 
the Psychology of Childhood, each operating in the one school throughout the 
investigation.* In our remedial education experiment, we had selected children 
between 8 and 9 years of age, but as many of the children in this group scored 
zero in one or two of the tests, particularly the Reading Comprehension test, 
it was decided that for the present investigation, children between 9 and 10 
years only would be included. A further reason for this change was that most 
other investigations in this field tend to deal with older children, on the grounds 
that they are more likely to present true cases of underfunctioning, as distinct 
from mere discrepanices in attainment arising from inequalities in rate of 
development. 

In each of the four schools, a list was drawn up of the names of all the 
children between 9 and 10 years of age. From each list, four groups—each 
consisting of six boys and six girls—and two groups—each consisting of twelve 
boys and twelve girls—were chosen at random. (We used random numbers for 
this purpose). 

In the account which follows the four schools are denoted by the letters 
P,Q, R,S, and the groups by the letters A, B, C, D, and E, F, respectively. 

(ii) Tests. The following Schonell tests were used : 

(a) Reading Comprehension test, Forms A and B (RCA and RCB). 

(b) Graded Word Reading test (GW). 

(c) Mechanical Arithmetic test, Form A (MA). 


For convenience, we shall sometimes refer to these tests by means of the 
abbreviations In the brackets. 


* Some doubt has been expressed concerning the reliability of test scores when the 
ve administered by Diploma students. An investigation, which we carried out 
4 shows that even in the case of administering the more difficult Terman-Merrill 
oma students are little short of the standard of fully qualified psychologists. 


tests a 
recently, 
test, Dipl 


ra 
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In the beginning a spelling test was also included but had to be dropped 
because of the limited amount of time available for testing. However, a small 
amount of data was obtained for the spelling test which suggested that, as with 
the arithmetic test, practice effect is not very great. (The data are not repro- 


duced here). 
- (iii) The testing programme. The testing programme in each school was as 
OHOWS : 


Groups tested 


Dec. June. 


It will be seen that the A-groups were tested in November and re-tested on 
each subsequent occasion of testing, i.e., at intervals of 1 month, 3 months and 
3 months; the B-groups in December and June only ; and so on. 


III.—RESULTS. 


(i) Mean gains (Groups A, B, C and D). Tables 1, 2 and 3 show the initial 
mean scores and the subsequent gains (as measured from initial score) for the 
A, B, Cand D groups in each school. (There was no evidence of sex differences. 
The results for boys and girls were, therefore, combined). 


TABLE 1 


IN MONTHS) ACHIEVED BY A, B, C and D-GROUPS IN 


SCORES AND GAINS ( £ E A 
COMPREHENSION TEST, Form D. 


ScuoxELL's READING 


INITIAL MEAN $ 


Group ^ Group B | Group € | Group} 
| D 
School Initial Gain Initial | Gain || Initial) Gain || Score 
score |— —————4| score | Score | 
(Nov.)| Dec. | Mar. | Jun. | (Dec) (Jun) || (Mar) (JunJ]| Jun.) 
on il da 
P 101-9 9-6 | 11-6 | 17-0 ||110-2 5-1 |100-6 9:5 | 96-2 
Q 106-1 | 11-9 | 19-0 108-0 107-0 | 10-0 | 31 
R 108-7 | 8-5 | 16-4 114-0 | 16-0 [112-5 123-6 
— an a 
H S 91- 3- 9: E | , , 5 
2 7 » 7 sil 9-9 || 88:3 7-5 ||102-0 5-8 || 94-0 
Total ..-- 102-1 8-4 | 14-0 | 18:8 |105:1 9-3 |105-4 9.9 LZ 2 
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ra 
IS 
[os] 


TABLE 2 


INITIAL MEAN SCORES AND GAINS (IN MONTHS) ACHIEVED BY A, B, C AND D-GROUPS IN 
SCHONELL’S GRADED WorD READING TEST. 


Group B Group C 


School Initial Initial] Gain || Initial 
score score 
Nov.) (Dec.) 


113-2 E 118-4 


116-0 : 119-8 


113-2 ‘ ; .5 |120-2 


84-4 É 75-5 


106-7 i 108-4 


TABLE 3 


INITIAL MEAN SCORES AND GAINS (IN MONTHS) ACHIEVED py A, B, C anp D-GROUPS IN 
SCHONELL'S MECHANICAL ARITHMETIC TEST Form A. 


Group A Group B Group C 


School Sai Initial] Gain || Initial Score 
score score 


(Dec.)| (Jun.) (Mar.) (Jun.) 


102-0 5.6 |101-8 97:7 


107-2 14:7 |115:0 


118:2 6-2 |121:6 


91:6 4-6 ||107:4 


104-8 7:8 ||111-5 


Several points can be made : 
(a) The schools differ significantly in regard to their initial mean scores 
(as was shown by an analysis of variance). 
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(b) The differences between schools in regard to mean gains are also 
significant.* Nor do these differences disappear when allowance is made for 
the differences in level of the schools (as was shown by an analysis of co- 
variance). 

c) Standard errors can be calculated for each of the gains shown in 
Tables 1, 2 and 3. But, in assigning a standard error to any one gain, it is 
necessary to know what variation is being taken into account. 

The gain for any one pupil can be analysed into (i) a school component (the 
mean gain of the school to which the individual belongs); (ii) an individual 
component (which may be positive or negative) ; and (iii) a random component 


(to explain fluctuations in the gain of the individual from testing to testing). 


If group mean gains are to be taken as estimates of the mean gains for the 
schools (with respect to which the groups are only samples), standard errors must 
allow for variation due to (ii) and (iii). For the three mean gains of group A, 


standard errors were found to be of the order 2, 2:5 and 3 months, respectively 
(for all three tests) and for the mean gains of groups B and C of the order 2-5 


and 3 months, respectively. 


We come now to the question of the standard errors of the mean gains 
derived from the four schools taken together. As estimates of the mean gains 
for the four schools only, their standard errors are about half the size of the 
standard errors already given. But as estimates of the mean gains for the total 
population of schools, of which our four schools are only a sample, their standard 
errors are very much larger since allowance must now be made for variation due 
to (i) as well as (ii) and (ii). However, our figures at least give an indication of 
the range of gains to be expected from schoolgroups. To have attempted more 
would have meant testing children in a large number of schools, a task which 
was beyond our resources. 

matter of interpreting the gains. An examination 
of the magnitude of the gains, particularly the gains made by the A-groups at 


their second testing (one month after the first), shows that they cannot be 


attributed entirely to maturation. (Here we are using the term to denote the 
genuine improvement in performance which we would expect the children to 
iod of testing). Obviously, practice effect is present. 


make over the pert 
If we assume a normal rate of maturation—one month of development per 
month of time—it will be seen that the effect of practice is about twice as great 


for the Reading Comprehension test as for the Graded Word Reading test. 
In the case of the Arithmetic test, practice effect would appear to be slight. 

It was intended that the design of our investigation would provide us with 
a measure of the mean rate of maturation of the four schools. For the overall 
mean scores obtained by the A, B, C and D-groups, when they were first tested, 
can be taken as estimates of the mean maturation level of the schools for 
November, December, March, and June, respectively. These mean scores are 


reproduced in Table 4. 


(d) There remains the 


* [n his criticism of our earlier investigation, Birch regarded the significance of the 
schools variance as evidence that remedial education was being effective in some schools if 
not in others. But this could only have followed from a significant interaction variance 
{treatments x schools). 


+ Other factors must also have operated (e.g., variations in conditions at the time of 
testing) but as they are random we shall ignore them in our discussion of the interpretation 
i It is to be noted that the standard errors given in the previous section are 


of the gains. +t X e p 
underestimates In that they do not take account of this source of variation. 
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TABLE 4 


OVERALL MEAN SCORES OBTAINED By A, B, C and D Groups WHEN TESTED FOR THE First 
: TIME. 


Mean Scores 


A-group B-group C-group D-group 
(tested Nov.) | (tested Dec.) | (tested Mar.) | (tested Jun.) 


102-1 5- 107-2 
106-7 


103-8 


gains made by the A-groups in the two reading 
Comparing the final scores of the A-groups 
with the scores of the D-groups (also tested in 
; nd the 5 per cent. level of significance. In the 

© mean score of the D-groups exceeds the final 
i > is, of course, not statistically 


Groups A, B, C and D). It is generally 
t more from practice in a test than dull 
i 1 d strated in the case of intelligence tests by 
Peel? and by Wiseman and Wrigley 5 Actually, Peel showed that practice effect 
slightly more than one standard deviation above 
; ut this was not confirmed by Wiseman and 
Wrigley who found a steady increase in Practice effect with level of ability. 

ffect is almost certain to operate in the case of 
simple average of the mean practice effects for 
alue. It is not the best estimate which we can 


Be fa data for each school had again to be 
: i any overall averages could be taken. The gains at the 
noe ley el o3 years) and two years below average (7-5 years) wee derived 

Wo reading tests and are shown in Tables 5 and6. (Since practice effect 


was small for the arithmetic test ana i 
s st, analysis of the i ^ 
considered to be worth the labour involved). Se RERIN AS EEE 
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TABLE 5 


16r 


DIFFERENTIAL GAINS (IN MONTHS) FOR READING COMPREHENSION TEST. 


Group A Group C 
School Reading B Gain : Gain Gain 
Nov.- Dec.| Nov.-Mar.| Nov.—Jun. || Dec.-Jun. || Mar.—Jun. 
= 9-5 yrs. 8-9 13-0 23-6 5-9 66 
a 7:5 yrs. 12-6 10-6 13-2 5-1 10-6 
zx 9-5 yrs. 13-0 22-0 26-9 76 12-9 
Q 7-5 yrs. 9:7 12-9 11-1 10-7 7:0 
29-2 15-4 18:1 


9-5 yrs. 


8-5 18-1 


7:5 yrs. 8:5 8-6 


9-5 yrs. 


19-6 


n 


7:5 yrs. 46 94 


13:3 


—1-9 


DIFFERENTIAL 


School 


9-5 yrs. 
(| 7-5 yrs. 


TABLE 6 


GAINS (IN MONTHS) FoR GRADED WORD READING TEST. 


Group A 


Group C 


Gain 


Nov.-Dec. | Nov.-Mar. 


| Nov.- Jun. 


Gain 


Mar.—Jun. 


0:5 85 


12-7 


56 


2-0 


11-9 


9.0 


18-8 


7:3 


15.8 


48 


9.6 


4-7 


74 


6.8 


15-4 


3:6 


11:9 


5-3 


14-1 


44 


11-8 
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It will be seen that, on the whole, practice effect at the 7-5 year level is 
not so much less than it is at the 9-5 year level. The big difference occurs with 
the A-groups when re-tested on the Reading Comprehension test. Whereas the 
pupils at the 7-5 level show no practice effect after the first re-testing, the pupils 
at the 9-5 level show considerable practice effect with each re-testing. Probably 
one reason why this occurs with the RC test and not with the GW test is that 
the former, unlike the latter, has a time limit and this is likely to be to the 
advantage of the brighter pupil at the second and third re-tests. 

(iii) Practice effect (Groups E and F). The difficulty in estimating practice 
effect in going from one form to the other of the Reading Comprehension test is 
that the two forms are not equally standardised. This difference in standardisa- 
tion becomes obvious when we consider the mean scores obtained by the E and 
F-groups on the two forms of the test (see Table 7). From these mean scores 
we can derive mean gains and mean standardisation differences (shown in the last 


two columns of Table 7). It will be seen that the discrepancy in standardisation 
increases with reading level. 


TABLE 7 


MEAN GAINS AND STANDARDISATION DIFFERENCES DERIVED FROM 


E MEAN Scores or E 
AND F-GRoUPS on RCA anp RCB. - 


Mean score 
School (months) Mean gain | Standardisa- 


(months) _ | tion differenc: 
RCA A-B (mths.) 


98-4 


106-0 2 i Mid 


104-4 


122.7 


114-6 
123-2 


a periment } E 
and practice effect. There i Ti r hoc hypot eses 
1 1s, € efore, no need for the ad 


We 
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TABLE 8 5 


s Mape Bv E AND F-cnoups on Two FORMS OF THE READING 


DIFFERENTIAL GAIN 
COMPREHENSION TEST. 


Gain (months) 


9-5 year level 7-5 year level 


6-6 2:5 
114 6-8 


4-6 


2:8 


4:2 


In our earlier study, we included statistical regression as one of the factors 
contributing to the gain of our retarded pupils. The gain due to regression is not 
iust a function of the reliability of the initial and final tests—as Birch would 
ines it—but of the correlation between the initial and final test scores. This 
correlation is affected not only by the errors of measurement (with which the 
reliability coefficients are concerned), but also by real differential changes in the 
performance of the children in the inter-test period (and by differences in test 
content where the initial and final tests are not the same). In the present 
investigation, the same tests were used initially and finally and the inter-test 

Regression effects were, therefore, found to 


eri re comparatively short. r 
r On the wholes the same would apply to our earlier study. 


IV.—CoACHING. 

A control group design enables the investigator to allow for such irrelevant 
factors as practice effect and regression, but in a remedial education experiment 
the net gains of the remedial group over the controls can still be inflated as a 
result of coaching. In our earlier study, we tried to demonstrate the coaching 

he ‘true’ (or net) gain for reading (as measured by 


effect by showing that t ; r j 
the Schonell RC test) ‘ disappeared ' when we applied an unstandardised test 


of different form and content. : 
Pringle and Gulliford threw doubts on this result and the conclusion we 


drew from it by criticising the validity of the unstandardised test we employed. 
But further evidence of a coaching effect was obtained when we re-tested both 
the remedial and control pupils a year after the finish of our experiment. The 
results are shown in Table 9. (The Schonell Graded Word test was included 
since our omission of this test from our experiment had been criticised.) 


* There is one exception. The large gain in going from Form A to Form B of the RC 

st, which we recorded in our earlier investigation, was spuriously high because of a differ- 

Un th standardisations which we employed for the two forms of the test. Instea d of 

DE. rai of 7-10 months, which we recorded, we should have obtained something of the 
etw 4 months (as in the present investigation). 
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TABLE 9 
MEANS FOR REMEDIAL AND CONTROL GROUPS (12 MONTHS AFTER EXPERIMENT IN REMEDIAL 
i EDUCATION). 


Remedial Group | Control Group Remedial Group Control Group 


Test (test-selected) (teacher-selected) 


yrs. months yrs. mths. 


S; yrs. mths. 
RCB 9 10-3 9 9-5 E x 9 4:3 


GW 9 8:9 10 1-0 E 9 94 


The differences between the remedial and control groups are quite non- 
significant statistically. The remedial pupils, therefore, failed to maintain the 
advantage in reading which they had shown at the end of the experiment.* 


The transitoriness of the gain of our remedial group we would regard as evidence 
of coaching. 


It will be seen that in our experiment 


and its follow-up we have used two 
criteria to detect the presence of 


coaching. 
(a) Coaching is present when gains vary considerably with the type of test 
used to measure the basic skill to b 


€ acquired. Such variation in gains we would 
regard as evidence more of a vari 


T ation in the acquisition of test-skills than of a 
genuine improvement in the basic skill. 


(b) A gain can be attributed t 
disappears in a fairly short period of time—12 months or less. 

Our second criterion represents a broadening in our conception of coaching 
as expressed in our original article and conforms more to the generally accepted 
idea of coaching. 

Substantial confirmation of our results has been obtained by Dr. J. E. 
Collins, a former research fellow of this department. Collins not only repeated 
our experiment on a larger scale, but also carried out a follow-up of twenty-four 
cases who had been treated at the Remedial Education Centre. The results of 
his experiment were identical with our own. Gains varied with the type of test 
used and were quite transitory, disappearing in three to twelve months. His 
follow-up of twenty-four past Centre cases also revealed a complete imperman- 
ence in the gains achieved by the Centre. The data which he obtained for these 
pupils are shown in Table 10. 


o coaching when it is transitory, i.e., when it 


TABLE 10 


Twenty-Four Past CENTRE CASES—INITIAL, FINAL AND FOLLOW-UP TEST RESULTS. 


Da 
ftem. ita on 


Data on 
entering Centre 


leaving Centre Follow-up data 


mths. yrs. mths. 


3 10 — 10 12 3 
Mechanical Reading 11 10 13 3 
Reading Comp : 9 8 10 6 
Spelling ..,. 9 10 10 5 
Mechanical Arithmetic E $ 1 2 


* The same re 


at the time we were by W. K. Gardner 


» one of our Diploma students, who, 
tested the remedial pupils of the 
ardner did not choose his control 
lace (on the basis of the data of the 
t each year), there was no question 
trols (as Birch had suggested). 


Pupils until a year i b 
Pa sume dL li struction had taken pl 
of the cl. 
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It would appear, therefore, that remedial education, at least as practised 
at the moment, has no permanent results. The gains which have been claimed 
for it are a product largely of practice effect and coaching. In other words, it 
accomplishes little more than is achieved by ordinary classroom teaching. To 
quote Collins, it represents an overlapping situation without essential change. 


VI.—APPENDIX. 
(a) Method for estimating differential practice effects when pupils ave ve-tested on the 
same test. 

The methods used by Peel? and by Wiseman and Wrigley® are suitable only 
with large samples of pupils. By combining our school groups we might have 
obtained a large enough sample, but a necessary condition for this combination is 

and initial scores should show no difference between 


that a covariance analysis of final 
We were, therefore, obliged to obtain 


schools and this did not hold in our own case. 
differential practice effects for each school separately before averages for the four 


schools could be found. 
Since our school g re small, there was no possibility of establishing 
any non-linearity in the differential practice effects (as happened in Pecl's investiga- 
tion), but this did not matter since non-linearity was likely to be slight and to occur 
mainly at the higher levels (in which we were not interested). 
Our method was simply to derive the equation of the ‘ equivalence line ' for the 
final and initial scores, that is to say, the line showing score (Y) in the final test as à 
in the initial test. The equation of this line is 


function of score (X) i tia eq 
Y -Y=9, /o, (X —X) 


e means and standard deviations of the initial and final 


roups were 


where X, Y, Gx0y are th 
for the school group. 
be written in the form : 


Y—b,X4-c, 
Y —o, |o, X 
actice effect, the rélation between Y and X would be 
t the differential practice effect P for any X is given 


scores, respectively k, 
This equation may 


where ^e—o, /o, and c,— 


Then, if there was no pr 
simply Y=X. It follows tha 
by the equation " 

P= (b. —)X --c, 

In our investigation, P was found for X—114 and 90 months. It is these values 


which appear in Table 6. 


(b) Method for estimating diffevential practice effects for two tests which are unequally 


standardised. 


In this case, it is necessary to have two groups in each school, one group receiv- 


ing test (A) first and test (B) second and the other group receiving the tests in the 

reverse order. In our investigation, we denoted these groups by the letters E and F. 

In the method we adopted, it is necessary to assume that while the two tests 

1 standardised there is a linear or approximately linear relation between 

equivalent ages as obtained by a school group on the two tests. This line we shall 

call the ' true ' equivalence line. We must also, as in (A), assume that the differential 
ractice effects are linearly related to level of ability. 


Our procedure was then as follows : 
(i) The equation of the equivalence line for the (A) and (B) scores (in months) was 


obtained for both the E and F groups in each school. Then, if X and Y denote the 
(A) and (B) scores, the equivalence lines could be written in the form : 
Y=b,X4+4 (Group E)—(1) 
Y=b,X+¢y (Group F)—(2) 


are unequally 
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(ii) The ‘true’ equivalence line lines somewhere between (1) and (2. If we make 
the further assumption that the amount of practice effect (in months) in going from 
(A) to (B) is the same as that iu going from (B) to (A), it is easy to show that the 
equation of the ‘ true ’ equivalence line is : 
b,(1-+b,) Daci + Ca 

2 X 4 2 

HE T 
(iii) By subracting (3) from (1), we obtain the differential practice effect P for any X. 


b,—b&. | Ci —Cy 
iui Die 2 
CTE Ee C 


Using (4) 
Table 8. 


It will be noted that, in all cases, differential practice effects were obtained 
relative to scores on Form A, Considerable divergencies between the published norms 
for (a)-and (b) have long been apparent to users of these tests. Re-standardisations 
are lengthy and expensive, but by the application of equivalence-lines to the present 
data, we have obtained a Te-standardisation for which we claim a high degree of 
onsisted of (i) the use of the ‘ true equivalence line ’ 
for tests RCA /RCB (described above) to zd de any score on one zh these tests into 


Èz 


we found the differential practice effects (differential gains) shown in 


sation of the graded word test is very generally 
E en as a basis, and new norms were read off for RCA 
and RCB. The former were for ree closely with the published norms, but 
i 5 IScrepancies, The new norms for Form B which we 
derived by this method are sh. in Table IT. These are for both sexes together, 
€ norms for boys and girls (as Schonell does for RCA 


ough—. ~ 4 i = 
ing the small adjustments Shoe for GW) this can be done by adding or subtract 


TABLE 11 


Rev 


ISED Norms y ^q Fit 
For Form B or SCHONELL's READING COMPREHENSION T 


Score | Reading Age || Score | Read: | 
Yrs. SM s Reading Age || Score Reading Age || Score | Reading Age j 
TS. Mths. Yrs. Mths. Yrs. Mths. 
"in ag B 22 9 1 22 11 3 32 3 0 
23 32 | ig 
4 vw il te 4 | 23 5 | as 2 
5 5 15 7 24 7 34 4 
6 8 16 9 25 10 35 6 
7 11 19 0 26 12 0 36 8 
8 & "A OH 2| 27 ^ 2] 37 9 
9 5 19 5 28 4 38 11 
10 8 20 4 29 6 39 14 0 
11 ET | ay 9d En 8 || 40 1 
10 41 2 
Sex Adjustment 
For scores 3 to 7 and 24 
24 to 4] y 
v one month from Reading oye) add one month to Reading Age; (Girls) subtract 
Scores 8 to 23, inclusive Boys) ^ 
two months from Radis e i two months to Reading Age; (Girls) subtract 


CuRR AND GOURLAY 167 


VII.—REFERENCES. 
1 Curr ,W., and GounLav, N. (1953). An experimental evaluation of remedial education 


Brit. J. Educ. Psychol., 23, 45-55. 
? Bircu, L. B. (1953). Comments on ' Àn experimental evaluation of remedial education,’ 


by Curr and Gourlay. Brit. J. Educ. Psychol., 23, 56-57. 

3 KELLMER PRINGLE, M. L., and GULLIFORD, R. (1953). A note on ' An evaluation of 
remedial education.’ Brit. J. Educ. Psychol., 23, 196-199. 

4Curr, W., and GOURLAY, N. (1956). Differences between testers in Terman-Merrill 
testings. Brit. J. Stat. Psychol., 9, 75-81. 

5 PEEL, E. A. (1952). Practice effects between three consecutive tests of intelligence. 
Brit. J. Educ. Psychol., 22, 196-199. 

¢ Wiseman, S., and WRIGLEY, J. (1953). The comparative effects of coaching and practice 
on the results of verbal intelligence tests. Brit. J. Psychol., 44, 83-94. 

? Corriws, J. E. (1956). The Effects of Remedial Education. Unpublished Ph.D. Thesis, 
Birmingham University. 


(Manuscript received, September 28th, 1959.) 


IIIL—PanT-TIME CLASSES FOR YouNG BACKWARD READERS. 


By H. H. HILLMAN anp R. L. SNOWDON 


(Handicapped Pupils’ Section, Education Department, County Council of 
Durham.) 


Summary. 1.—A scheme for the remedial teac 
to determine whether part-time teachers worl 
ward children could significant] 
pupils. 

2.—130 children of junior school age had been tested 
scheme by April, 1958. 
' phonic ' approach. 


3.—Comparison was made between the progress of backward children who 
attended remedial classes and children similarly handicapped who did not. 


hing of reading was established 
king with small groups of back- 
y improve the reading achievement of their 


and taught under this 
The remedial teachers adopted a predominantly 


_, 4.—The scheme was found to be acceptable to the schools from which the 
children came. 


oy results appear to be roughly comparable with the results of similar 
(odmore elaborate) remedial schemes and with the results of teaching retarded 


ading to junior school children were 
by the Education Committee. The 
d guidance clinics of work which was 


such pupils are due in alm 
There were four 


Shite : 5 (as they were called) at each of two 
"elio AL €rs from nearby schools. Each class con- 
twenty-six schools were dealt wi twice a week. A total of 130 pupils from 
Teaching was lasd i^ ealt with during the period described in this article. 
Sly by qualifie -time teachers, principally married 

training in remedial end ol experience, but otherwise with no special 
: s m accordance with the hypothesis 

ed. One full-time teacher was also employed 


under the « E 7 
e scheme and was responsible for the selection of children and deciding 


when attendance should st 
Children between 8 and l| 

. s a 1l years ol 

T they Were more than two Beds bim p 

their age, provided that they had not previ : 

ally sub-normal. It was not i x ep 

sub-normal to be adm 

date of entry. 

although reading to th, 

be retarded far [fos tie i Lo ure ua egere is 

e ede me attainment leve] of Which he or she was capable. The 
en were not exclusively children with attainment far 
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below I.Q., nor exclusive of many children below average inintelligence. What 
we offered these children was more individual teaching than is possible in most 


primary schools to-day. 


II.—TESTING AND TEACHING IN PART-TIME CLASSES. 

Selection for admission was made after each child had been fully discussed 
with the head teacher and class teacher concerned. No intelligence test results 
were available (except in the case of eight children referred by educational 
psychologists of the School Health Service), but Burt's Graded Vocabulary Test 
of Reading Accuracy! was administered to each pupil for the purposes of 
selection and the measuring of progress. This standardised test was chosen 
because it was considered that in the final analysis reading achievement springs 
from independence in the recognition of unknown words which itself depends 
upon developing phonic ability. The Burt test, though far from ideal, appeared 
to be more closely related to the growth of phonic knowledge than any other 
test on the market at the end of 1956, when the preliminary tests were given. 
The average amount of backwardness in reading, on entry to adjustment classes, 
as measured by the Burt test, was three years two-and-a-half months. Similarly, 
a pupil's readiness to leave was decided jointly by the adjustment class teacher 
and the child's own teacher and head teacher. In reaching this decision, the 
full-time teacher in charge of adjustment classes played the part of co-ordinator, 
visiting the schools and providing a Reading Age obtained with the Burt test. 

Each remedial session was from two to two-and-a-half hours long and quite 
obviously children could not be expected to devote their whole time to reading 
only. The only request made of the teachers was that class activities should 
invariably be related directly to the reading improvement of each child. It 
was hoped that within the scope of this request each teacher would be free to 
develop her own ideas and orientate her instruction to the age and interest level 
and particular reading difficulties of the individual child. The basic reading 
series provided for each centre were Emily Smith's Big Ben Readers and Daniels 
and Diack's Royal Road Readers. In addition, there were a large number of 
supplementary reading books of which Gertrude Keir's Adventures in Reading 
series proved very useful. None of the seven teachers at work during the 
experimental period used methods which differed markedly from ordinary 
classroom procedures. All, however, adopted a consistently ' phonic ' rather 
than a 'whole-word ' or "sentence-method ' approach. It was adjustment 
class policy to respect the independence of the teachers' professional opinions and 
this agreement on method between organiser and teacher was fortuitous, not 
contrived. A wealth of materials (plasticene, coloured pencils, scissors, card, 
and so on) for the construction of reading apparatus and for related activities 
was available at each centre. This was used extensively by the teachers to 
enlarge and reinforce the reading and writing experience of their pupils and to 
vary their own rather formal ' phonic ' techniques. 


III.—REsurrs. 


The success of this type of remedial teaching as a means of overcoming 

backwardness in reading was measured primarily by the results of the Burt 
able 1). 

= re raus improvement in reading ability of eleven months for children 

ended an adjustment class for one term, to two years five months for 

attended for three terms, would appear to bea satisfactory outcome 

: d the following qualifying factors. Five 


ing bearing in min: : 
m Mori testing in the groups represented in Table | 


who att 
children who 
of remedial 
children who were 
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TABLE 1 


IMPROVEMENT IN READING ABILITY OF CHILDREN ATTENDING CLASSES. 


Improvement after | No. of Children Least Greatest 
Attendance FOR Tested Improvement | Improvement 


Average 
Improvement 


One term 


3 yrs. 6 mths. 11 mths. 


3 yrs. 24 mths.| 1 yr. 7 mths. 


Three terms 3 yrs. 1 mth. | 2 yrs. 5 mths. 


were regarded as being unsuitable for this type of instruction. Of one of these 
children his head teacher remarked : “ (He is) emotionally unstable and with a 
speech handicap is making slow progress in all subjects. I think he ought to be 
interviewed by a psychologist with a view to special school training." It was 
recommended to head teachers that these children should be reported to the 
School Health Service so that their disabilities might receive further investiga- 
tion. So far as this affected the results shown in the above Table 1, entries in 
the column headed ‘ Least Improvement ' represent the achievement of one or 
other of these five children in each case. The results obtained from these 
children were, of course, included in the populations from which the averages 
of improvement were calculated. The test results from which these averages 
were obtained were all provided by children with a record of over 60 per cent. 
attendance at an adjustment class. Their average attendance was 84 per cent. 
; i à cient how that the children’s reading made some 
improvement: m order to justify this special educational provision, since it 
could be claimed that pupils would have progressed equally well if left in their 
pW a ad In an attempt to meet this point, the following two studies were 


(i) In Table 2 are shown the Tes ri E 
i " ; A e ults of a comparison between the average 
of viii € reading ability of two groups of children matched on criteria. 
arity of age, sual, socio-economic background and degree of backward- 
i € up of twenty-five children who spent at least two 
terms (Spring and Summer Terms, 19 7) in ordinary junior schools, and in some 


lo i A ; € 
five diden cf Cie ai thout attending an adjustment class. The twenty- 


Spent two terms in ordinary junior schools but, in 
TABLE 2 


COMPARISON W 


ITH CONTROL GROUP, 


MATCHED GROUPS 


(25 children) dun Average Average 
Average Age Retardation Improvement 
| after two terms 
Group A 


(In own Schools) 9 yrs. 6 mths. 


3 years 4 months 
Group B 
(In Adjustment 
MASS) © 252 0.2 9 yrs. 7 mths. 3 years 


l yr. 11 mths. 


e—a ————— S ÓÓ 
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addition, they travelled twice a week throughout this period to an adjustment 
centre for part-time classes. The children in both groups were tested with the 
Burt Vocabulary Reading Test at the beginning and end of the specified period. 
A test of the significance of the difference between the means of these two groups 
indicated that this result (S.E.,—0-1219) is significant above the 0-01 level. 

(ii) As a further check on this finding, however, a different type of control 
group was used. In this study the average improvement in reading ability of 
twenty-one children who spent two terms in their own schools without adjust- 
ment class help was found to be five months, compared with an average improve- 
ment in reading age for the same children, after attending adjustment classes for 
two subsequent terms, of one year ten months. These two means were tested ior 
significance and were found to be such (S.E.,—0-154) that it is most unlikely 
the difference between them occurred by chance. 

So far attention has been directed to a statistical analysis of the achieve- 
ments of adjustment classes. In practice, however, the success of the centres 
was measured by many factors which cannot be represented numerically. In 
particular, the interest and active co-operation of the children themselves, their 
parents, head teachers and class teachers were evidence of the value of this type 
of remedial teaching. It is insufficient to provide a specialised educational 
service which is validated by statistical proofs, unless the head teachers who are 
free to avail themselves of this service are of the opinion that it performs a 
necessary function and that the means chosen are both practical and efficient. 
In particular, the willingness of parents to allow their children to attend 
adjustment classes depended entirely on the good relations established by the 
head teachers of the children’s own schools. Considerable importance may, 
therefore, be attached to the fact that of the twenty-seven head teachers con- 
cerned, all expressed favourable opinions of the work of the classes and none 
contrary. ep 

Various education authorities, as well as individual investigators, have 
undertaken researches into the value of remedial class teaching. Despite this, 
it was difficult to find reports of this character which were directly comparable 
with the present report. For interest, however, several comparisons were made 
with work by other teachers and it is contended that the rate of improvement in 
Durham compared very favourably with rates obtained elsewhere, e.g., Man- 
chester,? Schonell, Birch,* and Miss Friedmann's retarded children.* 


IV.—CONCLUSION. 


The relevance of these results to those administrators and teachers con- 
cerned with backward readers in our junior schools appears to us to be two-fold. 
First: we conclude that remedial classes, economically organised as described 
above and staffed by part-time teachers without specialist training, were able, 
in Durham County, significantly to reduce the reading backwardness of the 
majority of children referred to them. Although it cannot be stated that better 
results might not have been obtained by a more elaborate provision, it would 
seem that Durham results bear comparison with published reports of experi- 
ments both similar and more complex. Second: we conclude that there is 
some indication that a more individual approach to ordinary backward readers 
hool age can produce educational results apparently as good as a 


of junior sc : 
similar approach to a selected group of retarded children. 
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IV—Tue Errects or REMEDIAL EDUCATION ON YOUNG 
CHILDREN’S READING ABILITY AND ATTITUDE TO READING. 


By J. DUNHAM 


(Summary of thesis approved for the degree of M.Ed. in the University of Manchester, 
May, 1959.) 
AIMS. 
The main aim of this investigation was to obtain experimental evidence about 
the effectiveness of remedial education. Evidence of this kind was thought to be 
particularly valuable because special provision for remedial education and criticism 
of remedial education have both been increasing recently in this country. 
Two criteria of effectiveness were used : 
(a) Improvement in reading ability. 
(b) Improvement in attitude to reading. 
The question which the main part of the investigation sought to answer was : What 
are the effects of remedial education on young children’s reading ability and attitude 
to reading ? . 
The thesis also contains some findings on the following controversial topics : 
(i) The progress of retarded children not given remedial education. 
(ii) The selection of children for remedial education. 
(iii) The use of non-verbal tests of intelligence in selection. 


METHODS. -— 
The remedial group (A) consisting of twenty severely retarded 9-year-old 


children of about average intelligence, took part in remedial reading lessons once 
or twice weekly for six months, amounting to about twenty-two hours of remedial 
work. They were paired, child for child,with a control group of children on the 
waiting list for remedial education, according to age, intelligence, reading ability 
and the socio-economic area of the school. The remedial children worked in small 
groups of not more than six as part of the normal work of an L.E.A. Remedial 
Education Service. 

Reading ability was measured on the Burt (re-arranged) Word Recognition 
Test. Retardation was measured by the reading test, the N.F.E.R. Non-Verbal 
Test 5 and the Verbal Scale of the W.I.S.C. (U.K. version). Attitude to reading was 
measured on a Thurstone-type attitude scale, prepared by the writer. 

The ' real ' gain in reading of the experimental group was the difference between 
the mean gains of the experimental and control groups. This was the increment 
credited to remedial education. Similarly, the ' real ' improvement in the attitude 
to reading of the experimental group was the difference between the means of the 
improvement in attitude to reading of the experimental and control groups. 

In order to investigate the prediction of success in remedial education, because 
it is an important aspect of the problem of the selection of children for remedial 
work, the reading gains of the children of the experimental group were correlated, 
using Spearman's rank order formula, with their R.A., C.A., 1.Q., and Attitude 
Score at the beginning of the investigation and with their changes in attitude during 
the investigation. Es 

To compare the efiect of using à verbal with that of a non-verbal test of intellig- 
ence, in the selection of children for remedial education, the respective results of the 
experimental group, when tested on the N.F.E.R. Non-Verbal Test 5 and the 
W.I.S.C. (V) were studied. 

To assess changes in attitude to reading, an attitude scale constructed by the 
r in an earlier research was used after some modification and improvement. 
‘This scale was of the Thurstone type; care was taken in its construction to ensure 
that the more ambiguous items were excluded, and that the remaining items were 
fairly evenly spaced by scale value. Three practice statements were added to assist 
the children to understand the procedure. The scale values of the twenty items 
finally selected ranged from 0-4 to 10-4, with high SV representing a favourable 
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attitude to reading, and the attitude score was the mean scale value of the statements 
with which the child agreed. The test-re-test coefficient of reliability of the scale 
was found to be +-77. The validity coefficient of the scale obtained by a comparison 


with a class teacher’s rank order was +-:59. The scale was administered individually 
and orally. 


RESULTS, " 
1.—Improvement in Reading Ability. s 


At beginning At end 


Group No. of Mean | “Real” 


children LA. I A A. Gain Gain 
(months) | (months) 


11-0 6:4 
4-6 — 


The ' real’ gains were significant at the 1 per cent. level. 


2.—Improvement in Altitude to Reading. 


Mean Attitude Scores 


Mean " Real" 
Group p_e Improvement Improvement 
At beginning at end in Attitude in Attitude 
A +68 T76 --0-8 +05 
B T73 


+7°-6 +0°3 


The ‘ real’ improvement in Attitude to reading was not significant. 


3.—The Selection of Children 


The followin, 
obtained : 


for remedial education. 


€ rank order correlation with the reading gains of Group A were 


og i) sna Extent of retardation ...... 
Rein aiteis Attitude Score .........,., 


= A 
None of the correlations was significant. 


4.—Comparison of the test results 


Test 5 and the WISA tv) of Group A when tested on the N.F.E.R. Non-Verbal 


Test Mean Range of | Rho. with 
disc Retarda- | Retarda- Reading 


tion tion gains 
ARBRE xn 3:6 3:10 +0-21 


WISC (V) .. +0-22 


Mean I.Q. when tested on 


thi E 
cent. level) than mean LO. wh p DAE 


test was significantly higher (at 5 per 
en tested on the verbal test. 


“i 


«^ 
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SUMMARY OF CONCLUSIONS. 

_—There was evidence that remedial education, as organised in this investiga- 
was, on the whole, effective in improving the reading ability of severely 
of about average intelligence. 

favourable effect of remedial education, as 
organised in this investigation, on the attitudes to reading of the same children was 
much less than, the effect on reading ability. If this finding is reported from other 
investigations it would appear to suggest that more research ought to be directed 
to the problem of getting the retarded reader to approach the reading situation in a 
wholesome way, particularly when his home and environmental conditions seem 


very unfavourable. 
3.— The finding that in the control group some children made very little progress 


in reading and some even lost ground, suggests their urgent need of remedial educa- 


8 
tion. But, in fact, that some reasonable gains were also recorded suggests the need 
for research to attempt t 


o formulate satisfactory selection procedures so that 
retarded children who would probably make progress without it are not given 
remedial education. 

4.—The need for further research into selection procedures is also suggested by 
the finding that the reading gains of the experimental group were associated at least 
as closely with improvements in attitude to reading as with IQ 

5,—There was evidence that, on the whole, the test scores of the children of the 
remedial group were significantly higher, when tested on the N.F.E.R. Non-Verbal 
"Test 5 than on the Verbal Scale of the W.LS.C. This suggests that their general 
level of verbal ability was on the whole, lower than their general mental ability 
level. Because of the reported close relationship between reading and verbal or 
language development, particularly in the early stages of learning to read, this find- 
ing appears to support the suggestion that retardation should be diagnosed as a 
discrepancy between the general level of verbal development (as shown by a verbal 
or ` language ' L.Q. or Mental Age) and the slower growth of reading skills. For this 
the W.LS.C. (V) would appear to be a more appr 


opriate test than the N.F.E.R. 
Non-Verbal Test 5. On the other hand, there has been no evidence that the 
W.LS.C. (V) would be more useful in predicting success in remedial education than 


the N.F.E.R. Non-Verbal Test 5. 

6.—Evidence is presented that a Thurstone-type attitude scale, when admin- 
istered individually and orally, can be successfully used to measure the attitude to 
reading of 9-year-old pupils retarded in reading. As attitude scales have been used 
almost entirely with secondary school pupils, this may be a useful finding. 


tion, 
retarded, 9-year-old children 


2 — There was evidence that the 


V—A STUDY oF MOTIVATION IN REMEDIAL READING 


By GEOFFREY R. ROBERTS 
(Summary of M.A. thesis, University of Wales, 1958.) 


NE C oe motivation, each combined with a suitable method of teaching 
children to read, were compared under normal classroom conditions, „The experi- 
ment, which lasted 6 months, was carried out twice. On the first occasion, eighteen 
children took part, divided into three groups of six; and on the second occasion 
thirty-three children took part, also divided evenly. However, nine of these 
thirty-three had taken part in the experiment once so their results at the second 


, the numbers were twelve 
dC. Twenty weeks, with 


roughly balanced groups according to their general reading ability. 
Initial Means : 
Group A—CA. 10y. 2m., Range 9y. 8m. to 10y. 9m. 
LQ. 98, Range 70 to 114, SD.— 13-7. 
RA. 8 y 6m., Range 6y. to lly. 
Group B—CA. 10y. 2m., Range 9y. 4m. to lly. 
I.Q. 100, Range 80 to 127, SD.— 13-2. 
RA. 8y. 5m., Range 6y. to lly. 2m. 
Group C—CA. 10y. 2m., Range 9y. 4m. to 10y. 11m. 
I-O. 104, Range 89 to 122, SD.=9-8, 
RA. 9y. 2m., Range 7y. 2m. to 12y. 2m. 


2.—DESCRIPTION OF THE THREE METHODS, 
METHOD A : The basis of motivation was curiosity and creative drive, and the 

method devised by Grace M. Fernald was used, with a few minor alterations. The 

first step was to restore the children’s confidence in their ability to learn (recondi- 

tioning). The power of this method was d i 

choose a difficult word. It was written on a 


; pronouncing its component parts simul- 
taneously, e.g., im-port-ant. Individual letters were not sounded. The tracing and 
vocalization continued until the child could write the word without copying ; then 
all the children were invited to try their Skill in 

The main task of writing stories and reading them now began. The child chose 
ide of each piece of paper. 


n a personal word book and occasionally was tested 
on them. When the Story was finished, i ^ Aithin 


n's st 4 i 
was very popular. The children illustrated their stories. If anyone Wished du a 
he could choose any available book arnt as above 
METHOD B: Motivation depended up 


$ advocated by A. E. 
It consisted of (1) a word omichment 


programme ; (2) co-operative reading; (3) individual reading ; and (4) incidental 


phonic analysis. 
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ramme was planned and occupied a third of the time. 
It took the form of non-competitive games and puzzles. These included : (1) finding 
the missing letter or word ; (2) finding the word where the letters have been jumbled 
up; (3) selecting words relating to 4 particular topic; (4) re-writing sentences 
where the words have been jumbled ; (5) learning a list of words; (6) crossword 
puzzles. Exercises Were given which included : (1) writing words relating to a 
topic; (2) finding words beginning with a given letter (no dictionaries) ; (3) match- 
ing words and sentences to pictures ; (4) matching words and sentences printed on 
cards against the same word or sentence ina bundle of cards or in à book. No formal 
lessons were given, but the essential points of the games and exercises were 
emphasized. " 

Co-operative reading included : (1) reading aloud in pairs or to the teacher ; 
(2) a more advanced reader helping à less advanced reader ; (3) reading in groups 
(soon discarded as futile). 

‘The remainder of the time was allocated to individual silent reading. The 
children worked through à graded list of 146 books, beginning On 2». appropriate 
book. Some inci sis of new words was employed. The sound and 
form of the word was pointed out, but the child was left to employ his own method 


of learning the word. 
: The basis for the motivation of Group C was competition. All the 


exercises and much of the individual reading were on a competitive basis and a 
system of awards by stars was used, with a prize at the end of each term. 
< method: was used, based upon lessons designed to 


An essentially phonetic n 1 
reading errors classified by Marion Monroe as: faulty vowels, faulty 


eliminate mAH d ei 
consonants, reversals of letters and sentences, additions and omissions of sounds 
f words, repetition, and refusal to attack a word. Monroe's 


and words, substitution [9 
basic approach was followed, though her method was not used in its entirety. A 
ach child showed a general tendency towards greatest 
der ' faulty vowels and consonants, and reversals, and prominence 
ection. 
Method C consisted of (1) Lessons and drills of deficient elements ; (2) Competi- 
tive games and exercises connected with these lessons ; (3) Individual reading of the 
zroup B with competitions in oral reading and word recogni- 


graded readers used by Group P» 
ere given and great emphasis was placed on all sounds, 


tion. Formal lessons W g a 
especially of letters. irst, the various vowel sounds were dealt with and then the 
ty’ Next came the consonants and then common reversals. Blends 
of two and three letters, silent letters, and finally common word-endings, e.g., tion, 
were illustrated. Other deficiencies were corrected individually. Formal instruc- 
tion in each lesson lasted for five to ten minutes. Competitive games and exercises 
d thinking of words, finding words ; underlining specific sounds ; 
placing words in sentences, and pronouncing words. Some games were not neces- 
sarily connected with the lessons and included crosswords, finding the missing letter 
or word, etc. Indivi eading occupied a little under half the time spent on this 
experiment and stars were given for good and improved reading. 


3. 


RESULTS. 

Two tests were used—Sleight's Non-Verbal Intelligence Test and Schonell's 

Graded Word Reading Test. Both tests were given at the beginning of the experi- 

ment and the reading ‘test was given again after ten weeks and finally after twenty 
a minimum. 


weeks. Testing was kept to ? i 
The mean improvement in Reading Age for each group was: Group A-—15:7 
months, SD.—6:5; Group B=7:9 months., SD.—42; Group C105 mons 


SD.—5:2. 

The differences between the means, when tested by Sir Ronald Fisher's small 

sample ' statistical technique, showed that the results of Group A were significantly 

better than those achieved by either Group B (t=3-66, P= 0-2 per cent.) oF 

Group ges DLT yk a <5 per cent.). The results of Group C were not significantly 
eof Group B (t= 1:45, P=about 20 per cent.), although with slightly 

{ children it appears probable that they would be. 
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1 s making 6 months improvement or less were : Group A—1 out of 

125 [uses i ien of 16 ; Group C4 out of t The numbers making more than 

r i vement were 8, 2 and 4, respectively. . i 
a pee was made between the M.A. and R.A. of each child at the begin- 
ning and end of the experiment. The mean reduction in retardation in reading was : 
Group A—9-6 months; Group B=1-8 months; Group C—4-1 months. The 
differences again showed the results of Group A to be significantly superior to both 
Group B (t—3-63, P= <0-2 per cent.) and Group C (t—2-25, P= 75 per cent.), with 
Group C again not quite significantly better than Group B (t— 1-30, P—about 90 
edd p Method A was shown to be very much the best method, and Method B the 
ism correlations between initial R.A. and the amount of improvement shown 
during the experiment showed Group A= —0-3; Group B= +0:2; Group C=+0-1, 
Although not one of these correlations by itself is significantly different from Zero, 
the difference between the correlations coefficient for Group A and the other two 
coefficients shows a tendency towards significance. Thus, Method A tends to 
favour the poorest readers more than either B or C does. 

In Method C it was the best readers at the start who showed the greatest success 
in the competition (correlation coefficient of initial R.A. and numbers of stars 
gained— +0-9, and success in the competition was not related to the amount of 
improvement (correlation coefficient 4-0-2). 

Those children who had formed Groups B and C during the second occasion of 
of the experiment were transferred to Method A for six weeks. The mean improve- 
ment in R.A. for each group of eleven children dur ng this short period was: 
Group B=5-9 months, range 0 to 12 months, S.D. 7; Group C=5-6 months, 
range 0 to 19 months, S.D.— 4.7, 


4.— DISCUSSION AND CONCLUSIONS. 

The results show that Method A is 
methods as a means of teaching reading t 
and 11 years. The means of implementin 
Írom the underlying motivation, therefor 
their relative merits. 

The advantages of Method A are : (1) It is concerned with the interests of the 
child. (2) It increases self-confidence and self-esteem through writing, reading, and 
displaying their work, (3) By actively engaging the child, it prevents reading from 
becoming Passive and purely mechanical. (4) Reading and writing are combined 
naturally, which increases the fluency of both. The children read their own stories 


they read other stories, (5) The pre- 


undoubtedly superior to the other two 
© retarded readers between the ages of 9 
8 a method could not be separated easily 
€, it is possible only to give impressions of 


r ed | n à natural 
context and could be learned in association With a vivid experience. The method of 
learning words was simple and, whilst conforming to a single pattern, it incorporated 
visual, auditory, tactile and kinaesthetic means. (6) The child is induced to concen- 
trate upon the form of a new word sufficiently B enough for him to learn it 

Disadvantages of Method B: (1) Gregari : f 
co-operation. Rarely were all the children ir 
` ing tended to become the prerogative of the least retarded children 
dominate rather than help the others. (3) It is difficult to obtain a suit: 
books for the older retarded child, (4) Following a scheme of graded re; 
the child in reading some books of little or no interest to him. (5) The children were 
always keen to change from reading to the word enrichment programme, although 
it failed, compared with the other methods, because it Was remote from th his 
reading and the transfer of words learnt in this way was a slow process ee 
Disadvantages of Method C - (1) The co ition failed in that it did not hel 
those whose requirements were greatest, even though the competiti web 
'arranged ' so that they could win Stars. Furthermore, the nee Sake “es a 


- (2) Learn- 
who tried to 
able range of 
aders involved 
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constant reminder to them of their failure. (2) Undue anxiety, shown by fidgetting, 
pencil biting, and facial contortions, was aroused in the competitions. Even so, the 


he competitive exercises, which suggests that the act of 


children were keen to do t 
' doing ' was a stronger source of motivation than the competitive urge. (3) The 
the approach to 


children easily became bored during the formal lessons, because 
words was too technical, dealing with unintelligible segments and letter sounds, 
rather than the sound of syllables. 

General conclusions : A major cause of retardation was an inability to learn 
new words. Method B was the least effective in rectifying this, because, as in Method 
C, the word enrichment programme was separated from the child's reading. Method 
B had the further disadvantage of allowing the children to attack new words in a 
haphazard manner. It left the child to find its own way of learning new words. 
Method C gave the children a set form of attack upon new words, but the presentation 
lacked the vivid meaningful associations of Method A. In Method C, words were 

t closely connected with something of immediate 


treated in isolation and were no ; 
interest to the child. In Method A, new words were a small obstacle in a large field 


of interest, whereas in Method C words became a huge isolated obstacle. 


Method A, in contrast to Method C, did muc 
and C did not possess the propensity for ' ego-involvement ' that Met! 
In Method A the motivation came from within the child, whereas in B and C it was 
externally contrived. The children of Group A were thinking and writing about 
things which interested them ; they could see their personal story book growing ; 
and they could choose their own books. But the children of Group B were too young 
to co-operate, and for Group C competition merely favoured the more able. Lastly, 
Method A is superior as an instrument of class instruction because it acts as a remedy 
to a wide range of deficiencies, it readily motivates the majority of children, and it 


holds their interest over a long period. 


OTHER SUMMARIES OF RESEARCHES REPORTED IN 
DEGREE THESES 


The Number Concepts of a Group of E.S.N. Children. 


By J. B. MANNIX 
(Abstract of thesis submitted for the degree of M.Ed. in the 


University of Manchester, 
September, 1959.) 


AIM. 


The aim of the research was to discover whether E.S.N. children behave 


similarly to or differently from the normal children tested by Piaget in the construc- 
tion of certain concepts of number and quantity. 


METHOD. 


Eight of the A get in The Child's Conception of Number 
were given to forty-eight E.S.N. children. The childen were selected to represent a 
range of mental age from 5 years to 9 years. Three boys and three girls represented 
each six-month period of mental age f. 


Quantities (with initial correspondence) p. 25 


2.— Conservation of Continuous Quantities 


ict zr re NOS: euis ni: eoe apparso Et wider CA p.4 
3.—Additive and Multiplicative Composition (Problem III, Measuring) p 222 
4.—Spontaneous Correspondence Singje ROWS: anini naasi p.74 
Beg Cupa) ComesPondence sce ne Tien Hime at p. 49 
6.—Co-ordination of Relations of Equivalence (Eggs and Cups) p. 204 
7 —Additive Composition of Number—Equating of Quantities p. 190 
8.—Additive Composition of Number Relations between Parts and 

NEROIN. iesus qoe aires cere, p. 187 
In giving the tests, Piaget's instructions wo. cay eee 


ere followed closely. ‘The tasks were 
andard questions, but supplementary 


individually. T information. The tests were given 


REsutts. 


perational "(Stage gies ponses were found which could be 


Likewise, ‘ Pre-Conceptual Sta ) Tesponses, according to Piaget's criteria. 

: 8e 1) responses w, i i ; p 
both these sta es, t i , P! 5 were identified in all the tests. At 
Piaget. E he children's responses were ver 


y similar to those quoted by 


tests, though only three of these s 
reported by Piaget. In the twi 
of an intermediate Stage, and in th, : 
group of second stage FTesponses, Evidence of ese Tae was a small and doubtful 
1s stage, upon which Piaget p i 
occasional. Get bases much of his h 
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Counting and Correspondence.—The E.S.N. children used counting to a greater 
extent than the normal children tested by Piaget, even when the test situation 
strongly suggested correspondence. Correspondence was used when counting was 
made difficult. The use of either was typical of the more advanced children, though 


counting was the preferred method. 
n Scaling of the Test Results. —Since some of the test results could not be classified 
into Piaget’s three stages of development, it was only possible to scale all the test 
results together on a basis of two categories— Operational ' responses (Stage 3) 
and ‘ Pre-Operational ' responses (Stages land 2). Guttman’s technique of scalo- 
gram analysis was applied and the responses were found to fall into seven scale types. 
The scalogram showed little evidence of disruptive factors and a reproducibility 
co-efficient of -94 was obtained. Though this figure met Guttman's requirements 


for ‘ scalability,’ it was known to be spuriously high because only two categories had 
been used. However, it was thought that near-scalability had been reached and 
that the following tentative conclusions might be drawn : First, the fact that a child 
is operational in one test does not imply that he is operational in others ; second, 
when the mean mental ages for each scale type are considered, no very close agree- 
ment is found between progression in the test responses and progression in mental 


age: 
Scale Type Mean M.A. 
S 


Completely Pre-Operational — 1 x A 
2 — 68 
3 — 79 
4 — 6:10 
5 — 72 
6 — 86 
7 — 71 


Completely Operational — 

t the mental age at which children passed into 

the stage of Concrete hild to child. It was possible 

to say, however, that no chi f less than 6-8 was at the stage of 

Concrete Operations in all the tests ; and that no child with a menta! 

than 6-5 was at the Pre-Operational Stage in all the tests. 
Relationship of Scaled Test Responses with Mental and Actual Age.—Spearman's 

dt 


The scalogram 


rank correlation was use o find the relationship between 
(a) Scaled Test Responses and Mental Age (P— 4+ +61). 


(b) Scaled Test Responses and Actual Age (P— + +52). 


CONCLUSIONS. . i 
1.—The behaviour of S.N. children in the construction of certain number 


E . : 
concepts is, on the whole, similar to that of the normal children tested by Piaget. 


2.— Test behaviour suggests that they pass through the three stages of develop- 
ment described by Piaget: 

3.—The period of development from the first to the third stage corresponds to 
rears to 8 years. 


a span of mental age of 9 Y 
4.—The mental age at which subnormal children pass from one stage to the 


next varies widely from child to child. 
5. —The development stage is not necessarily constant for the same child from 


test to test. 
6.—The absence of evidence of an intermediate stage may mean that such à 


e does not exist or may indicate a lack of discrimination in the test. 


7.—Counting appears to be the method preferred by E.S.N. children when 
dealing with a number problem, but correspondence may be substituted if the test 


situation makes counting difficult. 


stagi 
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A Study of the Reliability and Validity of the Children’s Apperception Test. 


Bv P. V. REDDY, M.A., Drr. REM.Ep. 
(Abstract of Ph.D. Thesis, University of London, Institute of Education, 1959.) 


INTRODUCTION. 
The Children's Apperception Test (C.A.T.) is a pro 


personalities. Research with this pictorial projective technique so far has been 
somewhat scanty, Up to the present moment no systematic attempt has been made 
to check the reliability of the test or to validate the interpretation of its responses. 
This investigation is aimed to study these attributes of the C.A.T. 


SUBJECTS. 

The subjects in this study consisted of one-hundred 6 and 7-year-old children 
selected at random from a junior mixed school. An equal number of cach sex were 
selected in each age group. In addition, nineteen male and six female deprived 
subjects whose ages ranged from 6 to 7 years were selected from two LCC: 
Homes for deprived children. 3 


TEST MATERIALS Uskp. 
(1) The Crichton Vocabulary Scale 


^ vay scale (1950) was used to measure the intelligence 
level of the subjects. The distribution of the LQs. of the three groups of subjects 
1s given in the table below. The 6 and 7-year-olds are referred to as Group I and 
Group IT, respectively, in the table. 


TABLE | 


DISTRIBUTION oF I.Os. or NORMAL AND DEPRIVED SUBJECTS ON THE CRICHTON 
VOCABULARY Scare (1950). 


LQ. Range n... CEU : — d Gopi Deprived Group 
M Mus N Female Male Female Male Female 
116—120 2 2 = 
111—115 2 1 1 l 2 1 
106—110 3 6 4 3 1 p 
101—105 9 2 4 g " 1 
96—100 5 5 6 A A 9 
91—95 4 9 6 6 2 
ee | 9 D 2 
Saini meow 25 2) p z 
103 a 3 19 6 
(2) Teachers’ Rating Scale. A rati . 
assess the personality Characteristic T is whe ined to enable teachers to 
The data thus obtai 


pupils who were selected for this study. 


as analysed 


ned i 7, 
be unduly complicated and too vs o1 Eae writer. Bellak's scheme was found to 


(4) Lydia Jackson's Test of Fi 


—— A —— 


a 
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Metuops or STUDY AND RESULTS. 

(1) Reliability. The reliability of the C.A.T. was approached from three 
aspects : 

(a) Test-Retest coefficient of the C.A.T. was obtained by re-administering the 
test to fifty 7-year-old subjects after an interval of three months. The correlation 


coefficient of sixteen personality variables ranged from 4-0:11 to 4-0-81 with an 
average of +0-45. 

(b) Score-Rescore reliability. 'To determine the re-score reliability of the C.A.T., 
records of the fifty 6-year-old subjects were re-scored by the writer six months later 
and the resulting coefficients ranged from --0-51 to +0°95 with an average of 4-0:82 
for all the scoring categories. 

(c) Reliability by the Matching Technique. Three judges were asked to match 
two sets of personality sketches drawn up from the test-retest C.A.T. records of 
eighteen male and eighteen female subjects. The seventy-two sketches were divided 
at random into six groups with six pairs in each group. A statistical analysis of the 
matchings was made and an average coefficient of --0-578 2:078 by Vernon's 
formula? was obtained. e 

(2) Validity. Construct validity 
applied. ; n " Sie » ] 

(a) Construct Validity. (i) Factorial Validity. The two sets of intercorrela- 
tions of the C.A.T. personality variables were separately analysed by Thurstone's 
(simple summation) method and three bi-polar factors were extracted for each age 


yroup. 
wes Exoup Ifactors: (i) Cheerfulness versus Depression; (ii) Extroversion versus 
Introversion ; (iii) Self assertion versus Self-submission. 4 
Group II factors : (i) Gheerfulnos enn Depression; (ii) Extraversion versus 
-oversion ; (iii) Adequacy versus Inadequacy. 
uw x study of normal and deprived subjects. The C.A.T. responses 
of the twenty-five deprived subjects were compared with those of a control group. 
‘The responses of the two groups were analysed and compared on the basis of the 
following variables : personality variables, inter-familial attitudes, anxieties and 
differences between the scores of the 


fears and outcome of stories. On the whole the enci en thi 
two groups on the above variables were found to be significant. The sign test results 
of the significance of the differences between the scores of the personality variables 


are given in the table below. Personality variables with a probability value of -05 


or less, should be taken as significant. 


and concurrent validity procedures were 


TABLE 2 
‘HE DIFFERENCE BETWEEN THE PERSONALITY VARIABLE SCORES OF 
NORMAL AND DEPRIVED SUBJECTS (Two-TAILED). 


SIGNIFICANCE OF 


Personality Variable High Score P 

Um General Emotionality -+--+ D :286 
2 Extroversion ..:- N -004 

3 D -210 

e N -022 

> 3 “134 

6 Acqu N -030 
us Sociability N 002 

8 Cheerfulne: N -002 

9 Depressi D -062 
10 Anxiety . D -002 
n Feeding .- N -504 
12 Ego strength . N -002 
13 Toileting | «ccn N .096 
14 Suprego Stre i wwe N -168 
15 Mother Identification N -204 
16 Father Identification D -504 

La 


N = Normal Group with hig! res. 
D= Deprived Group with high scores. 
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(iii) Changes in Personality. The C.A.T. was used to measure changes in 
personality of fifteen subjects who were considered by their teachers to have changed 
racteristics over a period of one year. Two sets of 

of teachers’ ratings—one before and one after the 
The data were analysed and 
In the majority of cases the changes 
th those shown in teachers’ ratings 


most in their personality cha’ 
C.A.T. records and two sets 
change—were available for each of the fifteen subjects. 
the changes were shown by means of profiles. 
in the C.A.T. graphs appear to correspond wi 
graphs. 

(b) Concurrent Validity. (i) 
The correlations between Teachers’ 
determined and a factor analysis of 


of four factors. On the basis of the factorial evidence it seems that the C.A.T. is 
useful in measuring such personality characteristics of 6 and 7-year-old subjects as 
cheerfulness, sociability, ego strength, asser: 

(i) Validation by the Matching Technique. Personality sketches of ten subjects 
drawn from their C.A.T. records were divi i 


judges to match with their own impressions of the 
Bes respectively, obtained an average of 50 per cent, and 


35 per cent. of correct matchings, This represents an average coefficient of 
(-- 0:624 + -160 by Burt's formulas a 


62 by B nd +0-570 + -076 by Vernon's formula.2 
(iii) The C.A.T. and the Family Altitudes Test Compared (F.A.T.). The 
responses of forty-one subjects to the C.A.T. and to Jackson's Test of Family 
Attitudes were compared to determine Whether any significant association between 
the variables of the two tests existed. The results on the whole indicate a moderate 
degree of agreement between the C.A.T. and the F.A.T, Discrepancies between the 
variables may be attributed to the differences between the two tests. 
Discussion. 
In general, this investigation has i i 
1 i stigat 85 Proved the C.A.T. to be fairly reliable and 
ber ea: di u» nr UT pelts on the whole are quite comparable to those 
estigators with project; i indi i 
study suggest the need obi dm Projective techniques. The findings of this 
inter-tester reliability. Furthe i 
: 1 : ascertain more accurately the 
relationship between the persona isti Í subjects and their repie 
to the C.A.T. An investigation into suc 
rivalry, toileting problems, etc., 
(and which were not available in t 
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Ée of English 


Grammar and Ability in 
h Composition, à 


Englis 


By NORA ROBINSON 
(Abstract of thesis submitted for the degree p 


19; 


i paa in the University of Manchester, 


AIM. 

The aim of the enquiry was t 

grammar is essential for the accurat 
ati 


to determine the degree of re] 
in composition. 


epit the proposition that a knowledge of 
Er v effective use of language and, in particular, 
P between knowledge of grammar and ability 


u 


NEU m | 
p ^ ee 
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_ The specific aims were to establish the degree of association between the two 
variables, to investigate the effect of differences in age, sex and intelligence on the 
ermine whether a knowledge of grammar enables pupils to 


relationship and to det 
write English which is accurate, logical and of a good style. 


METHODS. 


Tests of grammatical knowledge, General Ability and composition were admin- 


istered to all second year and fourth year pupils in four large, single-sex, urban 
grammar schools. Five pupils were chosen at random from each of the twenty-nine 
classes completing the tests and the resulting sample of 145 pupils was used for the 


investigation. 
was the Manchester General Ability 


The intelligence test u 
ior I which has a reliability coefficient of -95. For the test of grammatical 
ten-minute objective tests were constructed, one for each of seven 


d one for sentence analysis ; the coefficien: 


The test in composition consis 
btained from the essays : general accuracy in spelling, 
grammatical accuracy alone ; and general composition 
ability. The former scores were obtained by deducting points for errors ; the latter 
score was deri marks of three examiners marking by the method of 
general impression. Coefficients of correlation between the pupils’ three essay 
marks (general composition ability) were -569, -446 and :605 ; the intercorrelations 
of the marks of the three examiners were “891, 770 and 641. A random sample of 
ninety scripts was re-marked by the examiners and coefficients for mark-re-mark 

Lastly, the aggregate scores (general composi- 


correlations were -93, -88 and :897. i ( 
he first occasion of marking were correlated with the aggregate 


tion ability) for t £ 
scores for the second a coefficient of -94 was recorded. 


each of a different type. 
Three marks were ol 
punctuation and grammar ; 


occasion and 


RESULTS. 


COEFFICIENTS OF CORRELATION : GRAMMAR AND COMPOSITION. 


SCHOOLS 


Years 


1=general composition ability- 
2— general accuracy. 
3— grammatical accuracy: 


a—first coefficient. 

p—coefficient with intelligence held constant. 
The calculation of coefficients of partial correlation indicates that interaction of 
intelligence with the two variables causes but a slight increase in the degree of 


association. 
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In general, there is a low degree of relationship between knowledge of grammar 
and the three aspects of composition examined in the enquiry. Of the three, 
coefficients are lowest for correlation between knowledge of grammar and gram- 
matical accuracy in composition. Inspection of the scores awarded for grammatical 
accuracy reveals that these are high ; pupils make few errors of this kind. There 
is a closer degree of relationship between knowledge of grammar and general com- 
position ability in schools H and B than in schools L and N, and there is reason to 
believe that this is associated with more intensive teaching, especially of grammar. 
The effect of differences in age and sex on these conclusions appears to be slight. 

Analysis of variance techniques were applied to scores for grammar, General 
Ability and general composition ability. There were significant differences, at the 
‘1 per cent. level, between the second and fourth year groups for all three scores. 
There were no significant differences between the mean scores of boys and girls for 
any of the tests. For General Ability and composition there were no differences in 
the mean achievement of the several Schools; for the grammar scores there were 
differences significant at the 5 per cent. level between schools N and L and schools 
N and B. 

Perhaps the true test of the value of grammar lies not in the degree of its 
relationship with composition but in the nature of that relationship : is it effective ? 
In the present enquiry there is no evidence that pupils from schools where there is a 
higher degree of association between knowledge of grammar and ability in composi- 
tion, or pupils from schools with a significantly different level of attainment in 
grammar, obtain, as à group, marks in composition which are significantly different 
from those obtained by pupils from Schools with neither of these characteristics. 


NEW TESTS 


WISEMAN, S., and WRIGLEY, J. (1959). The Manchester Tests : General Ability 
Test (Senior) 2; Reading Comprehension Test (Senior) 1; Mechanical 
Arithmetic Test (Senior) 1. University of London Press. Manuals Is. 6d. 
each. Test Blanks 36s. 2d. per hundred (intelligence and reading), 54s. 3d. 
(arithmetic) ; 6d. and 9d. wach. 

‘The authors point out that there is a dearth of standardised attainment and 
intelligence tests for 14-- pupils. These ones are excellently constructed, easy to 

ive and score, and are standardised on a complete year group of over 13,000 pupils 
in Manchester schools (omitting independent, direct grant and E.S.N) Deviation 
quotients run from 70 or below at 13: 6 to 140 or above at 15 : 2. Presumably, the 
figures below 14:0 and above 15:0 are extrapolated. Split half reliabilities are 
between “95 and :97. Time limits are 60, 45 and 60 minutes respectively. 

The intelligence test is in omnibus form, with 100 items, 48 of which are of the 
conventional verbal types ; 13 are letter series, and as many as 39 involve numerical 
insight—perhaps rather an odd weighting. None are non-verbal. 

The reading comprehension passages (eleven in number) are varied and well 
chosen for modern and grammar school pupils. Boys do a little better than girls, 
and there are separate sex norms for this and Arithmetic. The suitability of the 
content of Arithmetic may be more open to question, though admittedly, itis called 
“mechanical ’ and may well be fairly typical of what is taught in modern schools 
Nearly half the questions are of the type included in Part I of an 11+ test, and there 
are scarcely any verbally stated problems such as make up Part II; instead, the 
tly fractions, decimals, percentages, and algebraic substitution. 

The authors seem to have in mind the future publication of parallel tests. 
Perhaps they could help still further by providing non-verbal intelligence, English 


usage, Mathematics, and other tests. 


remainder are mos 


Eysenck, H. J. (1959). The Maudsley Personality Inventory. University of 
London Press. Manual 2s. 6d. ; questionnaires 30s. 2d. per 100, 5d. each ; 

scoring keys 2s. 

"This is a standard type of personality inventory, yielding scores for Neuroticism 
and Extraversion. A good deal more care seems to have been taken than in many 
American tests to secure à small number of items (forty-eight in all) which yield 
reasonable internal consistency and which avoid overlap between the two traits. 
Here the intercorrelation is usually around :15, probably because Extraversion is 
interpreted (like Guilford's rhathymia) more as liveliness thanas sociability. A short 
scale of twelve items is also provided, for which fair reliability is claimed. 

Norms are based on @ * quota sample ' of 1,800 normal adults, and results are 
quoted for various groups of hospital patients. It seems à pity that percentiles are 
not given, in addition to means and standard deviations. Professor Eysenck is 
commendably cautious in stating that this is to be regarded as a research instru- 
ment, designed for experimental and other purposes. Obviously it would be most 
undesirable if personnel selectors and others without adequate psychological training 
applied it wholesale, and interpreted its results as giving complete measures of 
whatever they thought neuroticism and extraversion to mean. 


Srorr, D. H. (1958). The Bristol Social Adjustment Guides. University of 
London Press. Manual—The Social Adjustment of Children, pp. 46., 25s. 
Specimen set of blanks, scoring templates and summary charts, 9s. 6d. 
Dr. Stott has been working on his method of personality diagnosis for several 
years, and after numerous revisions, has now provided forms for the assessment of: 
(1) boys at school ; (2) girls at school; (3) a child in residential care; and (4) the 
child in the family. The Manual, though somewhat difficult to follow, is worthy ot 


close attention from child psychologists and others interested in personality. Dr. 
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iticises igidity of the ‘ objectivist,' with his fixed traits or types of 
rn sali en the sue d of the clinician All that he claims to supply is : 
Pu E of behaviour in particular contexts, grouped under a number o 
= jdi we or ‘attitudes’ (Anxiety, Hostility to Adults, Restlessness, etc.) and 
z aded for their seriousness in respect of unsettledness or maladjustment. Clearly, 
fess can be of considerable value to the psychologist in the educational clinic, the 


mpt 

to get away from the artificiality of the objective, projective or questionnaire test, 
v i "irs ' far can any observer, who 
'evertheless, a number of doubts arise. First, how 1 s 

has bs cem acquaintance with the child to answer the questions, avoid halo and 


me patterns of 
5 years. How- 


others who use 
the Guides, and we will look forward to future reports. 


Manual, is so high. 


Packets 
summary sheets ( 


' Diagnostic 


SOUTHGATE, V. (1959). Southgate Group Reading Tests, Forms A.B ana € 

University of London Press, price 5d. per test blank, Manual 2s. 6d. 

Miss Southgate has done a useful job j à group reading test 
over the5:9 to 7:9 age range, in three parallel forms. It follows the conventional 
word-recognition form: that is, each child has to ring one word, read out by the 
tester, in a group of five. The test i 


is untimed, and takes about 15-20 minutes. The 
blanks are well-printed and the Manual Clear, 


It is a pity, perhaps, that no information is 
familiarity with the tester's pronunciat; 
high reliability quoted as between diffe. 
à wide age range. However, about half t 
the correlations of around 0-90 ( 


representative of the count 
complete population, The 


; at teachers do not understand standard 
the absence of indications of Spread mean that there is 
below-average younger, or above-average older, readers. 


P. E. VERNON. 


BOOK REVIEWS 


BARANYAI, E., and LENART, E. (1959). Studies in the Psychology of Thinking 
in Written Communication. (In Hungarian.) Publication of the Hungarian 

Academy of Sciences, Budapest. p. 316. 

R This is a study of simple communication in children’s composition. Its object 
is to throw light on the psychology and development of thinking, with particular 
reference to logical thinking and forms of communication. The investigation was 
carried out on school children between the ages of 10-14 from General Schools in 
Budapest. Two different methods were used : 

(1) Various types of composition were devised for the purpose of the study, 
such as preparing a sketch from a given essay ; repeating the pattern of a given essay 
in an essay on another subject; writing an introduction and conclusion to a given 
piece of writing ; arranging the cut-up paragraphs of an essay in a logical order ; 
fitting a sentence lifted from its context into its correct place, and completing 


sentences. 

(2) Analysing children’s compositions written in the course of Hungarian 
lessons at school. Two groups of essays were analysed: (a) free compositions ; 
(b) essays written on subjects requested by the investigators. The main points 
investigated were : logical sequence, correct reasoning, and structure, i.e., relation- 
ship between parts and the whole, and the use of direct and indirect speech. 

The exhaustive and competent analysis of the material with ample quotations 
and illustrations rather bears out our knowledge on the subject than throws new light 
on it. The main factor in the development of logical thinking and abstract reasoning 
is age, and the trend of development leads from the concrete to the abstract as 


demonstrated by Piaget. j s 
The outstanding merit of the study is the contribution it makes to the teaching 
of composition and to the use of composition as a means of developing logical 
thinking. It is shown in great detail where and why children fail in satisfying the 
demands of logical thinking, but it also demonstrates the weaknesses of teaching 
methods, for example, the over-emphasis on personal experience in collecting 
material and the corresponding lack of experience in objective, detached description. 


This is an interesting and stimulating study. M. VIDOR. 


Beck, S. J., and MOLISH, H. B. (edit.) (1959). Reflexes to Intelligence: A 
Reader in Clinical Psychology. Illinois: Free Press of Glencoe, pp. xiv-- 


669, $8-50. 


This is a collection of seventy-three articles tracing the development of clinical 


psychology. The papers, which are taken from various sources, are unabridged and 
psy By pap s are given. The quality of the selection is excellent, 


over two-thousand reference Bi : 
otable omissions, and the very names of the authors indicate 


though there are some n C ip ts : i i 
the worth of the individual contributions. The major viewpoints are represented 
and the articles are separated into six sections each preceded by an original contribu- 


tisn from the editors. 

. As an introductory reader in clinical psychology it seems too lengthy and 
difficult. The numerous details of so many research articles tend to obscure thé main 
trends and the editors’ introductions do little to remedy this. Treatment is omitted 


and there is no index. 

To whom will this book be useful ? For it should be useful, containing as it does 
such a large number of important articles. Probably to those who are already 
familiar with the field, to thesis writers and to lecturers who want original sources 
and detailed reference lists. To the practising clinical or educational psychologist 


seeking a clear exposition of current trends it is of limited value. 
Maurice J. TYERMAN. 
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Buros, O. K. (edit.) (1959). The Fifth Mental Measurements Yearbook. New 

Jersey: Gryphon Press, pp.xxvii4-1292, $22-50. 

This is the eighth publication of a series which be, 
order '' to assist test users of all kinds—educators, teachers, personnel workers, 
psychiatrists, psychologists, sociologists and others—in locating and evaluating 
tests and books on testing." That is still the fundamental purpose. By emphasis on 
sound, unbiased reporting of tests and reviews, the editor claims that the year-books 
have helped test users, constructors, and reviewers to be more critical and discrim- 
inating. 

The Fifth Yearbook covers a seve 
two sections deal with tests and books. 
reviews of tests; there are 6,468 referen 
of specific tests. Altogether 485 books 
are listed together with 535 excerpts fro 


gan twenty-four years ago in 


n-year period from 1952 to 1958. The main 
In ali, 957 tests are listed together with 746 
ces on the construction, use and limitations 
on mental measurement and allied subjects 
m reviews of these books. There are classi- 
fied indexes of publishers, titles, and authors and a most useful index of tests listed 
according to subjects. More attention is given to the “ sections covering achievement 
batteries, non-projective personality tests, English, foreign languages, group intellig- 
ence tests, mathematics, miscellaneous areas, reading, science and social studies." 
The general plan follows the pattern of previous yearbooks and cross references 


are easily traced in this and earlier editions. Brief information is supplied, where 
available, on essential test facts : 


E age range, method of scoring, time of test and sub- 
tests, cost and when possible reliability and validity figures; too often these last 
two important facts are not supplied by the test constructor, but critical and perti- 
nent reviews often reveal why this information was withheld. 

T his edition gives comprehensive and up-to-date information on test material 
published in English-speaking countries, but it is sad to note that because of increased 
costs, and the vast growth of material on mental measurement, this may well be the 


last of these valuable reference volumes, It is proposed that the Sixth Yearbook 
should cover work in the United States. 


f i A smaller book, Tests in Print, published 
esten i ape n] would contain a comprehensive bibliography of titles 

shed in English-speaking countries, If this i ‘ PE TE 3 
prehensive and near-international coveraj us 1 thei laet Buros vot Yaroo 


“Seti e Ec on mental measurement, it is most 
fitting that it should have been dedicated to Sir Cyril Burt. = NW. H. Kine. 


CLARK, M. M. (1959). Teaching Left-Handed Children. Scottish Council for 


Research in Education, Publicati M 7. Universi 
ende A ar cation No. XLIV. University of London 


This pamphlet aims to make avail; a a sasi stateme 
of the findings of De Cie tn ilable to teachers and others a simple statement 


th ai d 
ed NIV! be om ‘attempts to te with the right-hand, or through mere 
ass d io Vioc zene is tolerated ; and there must also be many more who have 
s n dist ms » but who might have written with greater facility and less 
emotionat disturbance, had skilled instruction and understanding been available. 


CRONBACH, L. J. (1960), Essentials of Psy i ; T" 
York: Harper, PP.xxi-+ 650, ay Sychological Testing (2nd ed.). New 
Many connoisseurs of menta] testing considered the first (1949) edition of 

Cronbachs Boolerto be the most stimulating and informative of any in this field. 

The new edition is bigger and better, though perhaps a little less s edi in its 

treatment. — It is very up-to-date, Over two-thirds of the refere y: amig Ae 

published since the previous edition ; and British Work receive. nces E bees 
attention than in most American texts, However, its very Wed rp mese ee 


1 
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ideas makes it rather less suitable as a textbook than some of its more pedantic 
competitors. Rather it should be regarded as advanced reading for the sophisticated 
worker in guidance and selection. At the same time it is extremely readable ; for 
example, the simple interpretations of factor analysis and decision theory are real 
tours de force. 

The main sections are : Purposes and types of tests ; Administrative proced- 
ures ; Types of scoring ; Validity and reliability ; Individual tests of general ability ; 
Group tests ; Classification of abilities, differential testing, tests of special abilities ; 
Personnel selection ; Educational attainments ; Tests of interests ; Personality 


nventories, ratings, behavioral tests and projective techniques ; Clinical approaches. 
P.EV. 


DE London : McGraw-Hill, 


E, JAMES (1958). The Psychology of Learning. 


pp-vii+367, 50s. 6d. 
y intended as 2 text book for psychology students and seems to 


e of the better things of its kind. 1t is closely packed with useful 
i e and clear arrangement make it easy to read. ]t is 
divided conventionally into chapters on diflerent aspects of learning, but more care 
than usual has bee! ubject matter of different chapters, and 
the student should fin 
‘The book as à whole aims to give an outline of the present S 
psychology. The author points out that the period of comprehensive explanatory 
theories and controversial warfare between rival ‘ schools ’ is now largely over, at 
least in the experimental branches of psychology. He is not, himself, attached to 
any one broad theory. The student is, rather, invited to consider more limited 
theoretical questions ; r instance, the nature of reinforcement ; the function of 
verbal mediation in prob > f the evidence 
be taken as established, what may 


is summarised and the au i ishec a 
be provisionally acepted on the balance of the evidence, and what is still uncertain, 


The main emphasis of the book is on conditioning, but insight, hypotheses and 
problem solving are also discussed. There are also useful chapters on motivation and 
emotion, in connection with learning. A final chapter on Theory and Applications 
of Learning,” contains 2 rather curious assortment of topics. Association versus 
Gestalt theory, neurophysiology, imprinting, psychotherapy, socialisation and 
education are tree id succession, and rather too briefly to be very useful. 
But the book, as it wi ld be a considerable help to students and of interest 
to many others who cialists in learntng psychology, but wish to keep 
themselves informed ab S. C. RICHARDSON. 


tate of learning 


edit) New York: Rinehart, 


958). Statistical Analysis (Rev. 
Ü ) Text and Workbook, 


d ind, $4.00. Workbook, pp- 76, $125; 


This is a sound, conser 


EDWARDS, 


vative, statistical text which should provide the student 
with a thorough understanding of basic principles. It does not attempt to 80 
beyond the standard topics (for example, elementary analysis of variance only is 
included, and factor analysis is excluded), but it makes the why and wherefore of 
statistical techniques unusu 
applications, its exampl 
It is accompanied bya 
chapter which are not, b 
to work to answer them. 
ies and training colleges are too variegated fo 


issuing similar Workbooks. But they could be a valuable teach’ 
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Epwarps, A. (1957). The Social Desirability Variable in Personality Assess- 
ment and Research. New York: Holt, pp. xv4-108, $3-25. 


it is both, in much the same Way as halo in ratings partly represents à genuine 
tendency for ' good ' character traits to inter-correlate, partly a tendency to rate 
people we like high on all desirable traits. Nevertheless, Edward's systematisation 


of this topic will be of value to all who work with inventories, ratings, Q-sorts and 
the like. Moreover, as his oO 


uke. IS own Personal Preference Schedule shows, it opens up 
possibilities of forced-choice tests or ratings which are less susceptible to these 
distortions, 


Fraser, E. (1959). Home Environment and the School. Scottish Council for 
emg Education, Vol. XLIII, University of London Press, pp. vii. 
, 10s. 6d. 


The Scottish Research Council has performed à useful service to educational 
psychologists in publishing Miss Fraser's investigation, which was carried out Some 
10 years ago. She performed the ast 
a parent or parents of each of 408 A 
environment relevant to School achievemen 
Parents' own educatio 


occupation, family Size, rooms Per person 
home and general home atmos 1 
Moray House L.O., and slightl on i ttai er at 
work did not. In combination these fact rupe present m RUE 
+750, respectively, the most efficient pre 
home. 
Miss Fraser provides a Sensible di i i i i 

eaucstion, and M ook: discussion of the relations of intelligence and 


A I L onment affects A B , 
educational implications of her Andines het, jand sake rings aut the pran 


FRIEDENHAIN, P. (1 lvi N 
edie ou ex 193 Me and Reveal : Interpretation of Handwriting. 


illustrations. It includes an i 
umber 

where the Braphologica] diagnosis z T 

One's doubts are, if an: i 


&raphological analyses on their 
P.E.V. 
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GRoxLuND, N. E. (1959). Sociometry in the Classroom. New York: 
and Brothers, pp. xviii--340, $4-50. Meo wie 
Since Moreno published Who Shall Survive in 1934, there has been a growing 
volume of sociometric research. Most of this has been reported only in specialised 
journals and the general reader has had to be content with passing references in 
books of a more general nature. Sociometry in the Classroom is to be welcomed as 
remedying this situation. The book contains an account of sociometric techniques 
with directions for giving and recording the results of sociometric tests, a discussion 
of the meaning of the results, and suggestions for applying them in the solution 


of educational problems. 

Too often sociometric writers have been more noted for their enthusiasm than 
for their powers of criticism. The present author is to be commended for showing 
both the importance of the technique and its limitations ; for it must be admitted 
that the very ease with which a sociometric test can be applied has induced what can 
only be described as ‘ test happiness ' in many devotees. This sane and judicious 
account of the ways in which the results of a sociometric test can add to our 
knowledge of a group of children and give guidance for their future treatment should 
do much to ensure that those who adopt the methods described do so with a full 


appreciation of what the test does and does not tell us. 
ewness of the sociometric diagrams shown. 


Tf the book has a weakness, it is the fi 
A large part of the appeal of sociometry has always been the clarity with which a 
sociogram portrays the relationships within a group of people. A book which sets 
out to describe the basic techniques of sociometry should surely base more of its 
exposition on a study of sociograms than is done here. This, however, is a minor 
point, and the book can be strongly recommended as satisfying a real need. Itshould 
find a place in all training college libraries. K. M. EVANS. 

sr R. (1958). Theories of Learning. (Rev. ed.) London : 
563, 45s. 
{Hilgard’s Theories of Learning contains most of the earlier 
] chapters, those on Wheeler and Maier, are omitted ; 
tirely in the form of additions. Chapters on some earlier 
writers, such as Guthrie and Skinner, have been brought up to date by reference to 
recent work and the one on Hull has been considerably extended. The author's 
evaluation of the theories discussed is not greatly changed. The new parts of the 
book are a chapter on Freud ; another on mathematical models ; and two chapters 
on 'current developments ^' These last chapters are concerned mainly with 
theories of reinforcement and discrimination learning, and with neurophysiological 


models. 
In surveying the presen' 


HircARD, ERNE 
Methuen, pp. vii-F 
The revised edition o 
book. Two of the origina 
other changes are almost en 


t state of learning theory, the author maintains his own 
eclectic position. He suggests that differences between learning theories are still 
partly due to the fact that experimenters tend to concentrate on one type of learning 

rlier view that we should be prepared to find different 


experiment. He maintains his € 
t different evolutionary levels, and that some types of 


types of learning process a 

human learning may be qualitatively different from learning in animals. 

Non-American readers may feel that there is rather little mention of work done 

outside the United States. Even Pavlov receives only two passing references. But 

the book remains the standard account of the search for a satisfactory theory of 
ll be welcomed by most people interested in the 

t and future development. 


learning, and the revised edition wi 
history of learning theory and its presen 
S. C. RICHARDSON. 


HUGHES, A. G., and E. H. (1959). Learning and Teaching. (3rd ed.) London : 

Longmans, Green, pP- x+484, 14s. 

‘The third edition of this ' classic’ text-book is sub-titled, as were the previous 
editions, ‘ AD Introduction to Psychology and Education’. It differs from them in 
having been, in part, re-written in an endeavour to take account of some of the 
recent changes in education and some of the recent developments 1n educational 
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psychology. The authors have also undertaken a partial revision of the bibliog- 
raphies, which follow most chapters and they have added numerous, and sometimes 
lengthy footnotes. Their convictions remain substantially the same as they were in 
earlier editions as does their sincere concern for education. 

It is a formidable task for any 
text-book two decades of education 
not entirely succeeded is no reflect 
the magnitude of the task. 


pair of authors to undertake to integrate into a 
al research and change. That the authors have 
ion on their efforts. It is rather a reflection on 


P. H. TAYLOR. 


Husey, T., and HENRYSSON, S. (1959). 
Comprehensive School. 
price quoted. 


Since 1950, Sweden has be 
16 years in common schools. This is a re 


Differentiation and Guidance in the 
Stockholm: Almquist and Wiksell, pp. 196, no 


countries, and the accounts of their own 
E th sla are of interest to the British reader, particularly 
Since the participants concluded that gradual differentiation and guidance from 


ees: 2 , Tec selection at 11--. Another specially noteworthy 
Pour is that the Swedish Ministry is co-operating with the Stockholm Institute of 
ucation in scientific research on the development of interests and abilities and the 
€ e r ite over this period. Itis difficult to imagine our own 
administrators basing their policies in the same way on the results of research. 


MonTENSEN, D. G., and SCHMULLER, A Sud, TES TS 
. Fork = wan EER, A. M. (1959). Guidance in Today's 
gom New York : Wiley ; London : Chapman and Hall, pp. viii 4-436, 


m | ehe ien ieri abuse s techniques, the twenty-one specimen 
aspects of this manual. The s t ne illustrative case-histories are useful practical 
British, two mis-spelt "The a th mg 's American : of 380 references only five are 
“background maternis se u OE maintain that a book on guidance must include 
Consequently, the first half ae 9 the philosophical and scientific bases of guidance.” 
theoretical basis Almost = Bes book discusses the meaning of guidance and its 
mention, four pages to F. Eie relevant aspect of psychology receives passing 
This broad shallow itii Re, d = behaviourism, three to sociometry, and so on. 
indeed reduce the appropriate: 5x $ € psychological background to guidance may, 
separate specialism ard ae — of the book to Britain, where guidance is not a 
have learned this b; s er OF psychologist concerned with guidance will (or 

es acksround from wider and more systematic study. The 

the roles of counsellor and teacher, will not 

ity of counsellors’ educational and psycho- 
BeA Seo the text provides valuable practical 
. recognize the importance of this aspect of 


nselling 


logical knowledge. However 
Spans for those who are ee 
education, 
Joun NissBer. 

NATIONAL ASSOCIATION p 

Phobia? Proceedings of the 15 Hifi (1959). Truancy—Or Schoo j 

National Association fo js Inter-Clinic Conference. London : 

In Britain, until a few year: * Pp. 40, 3s. 6d. 

im 


OR MENTAL 


" E ss S ago, a 
offence: it is now seen as an eser us fr 
tant co 


facts and gives suggestions as 
€y and school phobia shade 
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into each other and that children showing these symptoms tend to be lonely, insecure 
and unhappy. Itis also made clear that in nearly every case the main trouble is not 
in the school but in the home, and that treatment must be based on meeting the 


child's individual needs. 
In general the reports are untechnical, lucid and enlivened by first-hand descrip- 
MAURICE J. TYERMAN. 


tions. This isa pamphlet worth reading and keeping. 


Orsox, W. C. (1959). Child Development. Boston : Heath, pp. xi--497. No 


price given. 
This is the second edition of a book first published in 1949. Init, the Dean of the 
School of Education at the University of Michigan gives to teachers and intending 
teachers a broad, well-documented and reasonably balanced review of earlier and 
mental findings in the field of child development. His book covers 
Mental and Emotional Development ; the Growth of Educa- 
tional Achievement ; the Child as à Whole; Human Relations in the classroom ; 
the Child in the Home and Community ; the Affective Life of the Child ; and the 
‘Teacher in Individual and Group Relationships. 
y say that the author's fundamental belief is that the child grows 
as a whole. Indeed, arising out of the work of S. A. Courtis, B. O. Hughes and Olson 
concept of Organismic Age, which is derived from 
both physical and psychological measurements, and the author makes great use of 
i k. Itis his clear view that learning is a function of total 
Only future research can tell us exactly how valuable this concept of 
It can be said, however, that he has drawn our attention 
in children that we sometimes overlook. For example, it is 
now manifest that the effects of remedial education with many children are not as 
satisfactory as Was hoped ten years ago. Perhaps we have placed too much stress 
on the backward child's emotional life, sociological conditions, and teaching methods 
per se, and overlooked genetic and constitutional influences that cause children to 
mature at very different rates. In a way, Olson's concept is not altogether new ; 
Burt used somewhat similar concepts in Chapter 6 of The Backward Child. 
British readers may be disappointed to find so few references to relevant work 
done in this country. They will also be surprised to find that the work of Piaget is 
‘There are a few inaccuracies ; the information (p. 45) 


dismissed in eighteen lines. : ) / 
that there are forty-eight chromosomes 1n most body cells and not forty-six, might 


be explained by new evidence coming to light between the time of the book going to 
press and publication ; ut British Courts have not been able to award corporal 
punishment to delinquents since 1948. However, the book is very readable and to 
be recommended. K. LOVELL. 


up-to-date experi 
the topics of Physical, 


growth. : 
Organismic Age will be. 
to an aspect of learning 


Pracer, J. (1960). Language and Thought of the Child. (3rd English ed.) 

London: Routledge and Kegan Paul, pp. xxiv-+286, 25s. 

PIAGET, J., and INHELDER, B. (1959). La genàse des structures. logiques 
élémentaires. Neuchatel : Delachaux et Niestlé, pp. 295, fr.s.13. 
Piaget's book on language, first published in 1923, reappears in its third English 

ted to a comparison of egocentrism in the language 


edition, with a new chapter devo p 
of young children when talking to an adult with that obtained in conversation with 


their peers. Piaget finds that about the age of three, egocentrism is greater in the 
company of an adult than in that of other children ; by the age of four the propor- 
tions tend to be equal. Socialised remarks addressed to an adult show an over- 
whelming predominance of questions, a feature which is not apparent in socialised 
remarks occurring between children. Piaget sees support in this finding (based on the 
conversations of four children) for his contention that the decline of egocentrism owes 
much to the conflict of view-points and opinions occurring in the peer society —where 
the child does not adopt a frankly subordinate role. Replying 
is at pains to point out that the fundamental fact is intellectua. 
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La génese des structures logiques élémentaires is a study of development in classi- 
fication and seriation. The former especially, the discussion of which takes up eight 
out of the ten chapters in the book, would seem to be of particular significance to the 
development of thinking as a whole. An elementary classification may be recognised 
in the adaptive behaviour of quite young children in relation to objects, and later in 


blue because he can see the red round beads, bu 


all the blue beads are round (because some round beads are red !. He can only 


compare the two classes if they are Co-extensive. The difficulty, according to Piaget, 
is that of co-ordinating the extensive i i i 


argument by analogy, A noteworthy f 
full quantitative tables to illustrate the find 


Medea ue pas gain, where earlier Piaget stressed the lack of social 
one-sidedness of perce: tio i poo te logica] “inking, wget now pee pee Es 
en is indispene eine 5 ^s he latter approach often leads to an over emphasis 
formal level." That X a is is ju? POrt to operational thinking below the 
P s Such an emphasis is incorrect is clear from the coincidence of the 


m and gear our methods to the understanding of the 
E. A. LUNZER. 


psychology, but also the many 
pegs of psychology ex us they achers. The authors present with subtlety the 


S Of general an, a ae Popular clothes borrowed from divergent 
~ i : : For instance, the earlier 
analysis (without SY. E J i 

eory of Personality, i"8 it such) in an attempt to give 
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Part A, amplifies the introduction, Part B, appositely headed “ Some Funda- 
mentals, " deals in five chapters with such topics as Motivation, Behaviour Mech- 
anisms and Growth, etc. In Part C, the meat, though thin, begins to appear at last, 
and the scene of general psychology is switched to that of social psychology, without 
calling the reader's attention to it. It is headed "Classroom Applications," and sub- 
divided into seven popular eye-catching titles such as: “ Unusual Learners,” “ Group 
Life in the Classroom," etc. Here one encounters, in disguise, such well weathered 
friends as Moreno and his sociogram, and Lewin, Lippitt and White, and their classical 
social climate experiment. Newer and more personal ground is broken in later 
chapters which deal with the various roles that the teacher has to play in relation to 
his pupils, where he has to act alternately as ‘ trouble-shooter,’ as parent substitute 
or as target for hostile feelings. He is also given advice as to how to deal ‘ psycho- 
logically ' with complex classroom situations, such as arise when a boy has stolen 
another’s money, etc. The chapter called “ Influence Techniques " deals with 
problems affecting discipline and leadership at school, where the teacher is advised 
not to enforce discipline crudely, but to devise methods which support children’s 
own self-control via a display of ' humor,’ by closing in rapidly ona potential mischief 
maker, or by removing “ seductive objects " (inkwells are not mentioned). Inci- 


dentally, the time-honoured traditional English Public School stick finds no favour 
Teachers are advised to give naughty children a 


in this American textbook. 
‘chance of cooling-off ' in corners, and are warned of the dangers of instituting 
' mass-punishment ' in order to detect a single culprit. The last chapter advises 


teachers who are in conflict to stick to the rule of the golden mean in most circum- 

stances affecting discipline or their own anger when aroused. The authors thus 

show that a solution in conflict situations is possible (despite their gaffe in the chapter 
heading, which includes in this connection, the word ' dilemma Dr 

The last section of this book, called ' 'Special Problems," is in some ways the 

the thread of ' general psychology ' changes to that of 


most interesting one, for here the c c ł 
* psychopathology.’ Interest derives less from the factual discussion and advice on 


the detection and management of neurotics, delinquents, geniuses and dullards, than 
from a spot-light on the psychological make-up of the teacher himself. Here are 
real glimpses of depth-psychology- Readers are made aware of the necessity for 
teachers to face up to some form of an inner self-analysis, especially in regard to their 
vocation. May one have taken up teaching to satisfy such needs as: status gain ; 
love for a subject ; identification with one's own former teacher ; love for children, 
or because of a reaction against one's own unsatisfactory childhood in an endeavour 
to create an easier world for the new generation ; the desire to re-live childhood 
patterns, or because ofa pre-conscious craving for affection or power ? A 
In recognising some of these variables, it is indirectly suggested that a cathartic 
effect takes place within the teacher, which in turn, must have a stabilising influence 
ona class. Teachers are also advised to learn that experience will ultimately help 
them to cope with complex problems, and that in personally stressful situations they 
may be aided by taking uP gue—or even a psy- 


a hobby, or by consulting a collea 
chiatrist ! § WS 
In a final chapter, the authors refer wisely and modestly to the “ Limitations of 
Mental Hygiene in Education," and thereby, as experts, let the cat out of the bag 
on-expert readers in the field of psychology. They 


in front of their essentially ™ É n : 
perform this task of deflation gently and with agility, covering themselves impres- 
sively by supplying a list of current references (including audio-visual ones) at the 


end of each chapter, and by attaching an appendix on sources of the literature and 
agencies in this field, and {he meaning of special terms, all for the unhibited use of 
budding teachers and psychological greenhorns. The psychological soup of this 
book has been well watered down in its approach to provide a primer in a series 
which could well be called “ Mental Hygiene without Tears." It is, nevertheless 
a useful textbook, especially for newcomers and potential converts. 

R. G. ANDRY. 
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TERMAN, L. M., and ODEN, M. H. (1959). Genetic Studies of Genius, Vol. V— 
The Gifted Group at Mid-Life. Stanford University Press, pp. xv-+-187, 
36s. 


This volume carries the follow-up of Terman’s gifted 
years after the initial testing, 
collected and analysed before T 


group to nearly thirty-five 
the mean age being now 44. The information was 

erman’s death, but most of the writing-up is the work 
of his co-author. Nearly 1,300 subjects, representing 93-95 per cent. of the original 
group are covered. 


The present volume is perhaps the least interesting of the five, since it adds 
little to the findings of Vol. IV. One gets the impression, also, that the interpreta- 
tion of the data is less skilled and lively than it might have been in Terman’s own 
hands. For example, the results on present intelligence consist of the scores of the 
gifted group, their spouses and other high-grade groups on a new form of the 
Concept-Mastery test, together with Terman-Merrill LOs. of 1,525 of the gifted 
parents’ children. There is no Way of interpreting the former in terms of general 


population norms (except through the fact that the gifted group have significantly 
improved on their scores of 


: 10 years ago) ; and the latter is almost equally meaning- 
less—for while the average child 1.Q. is 132-7 (one third scoring 140+ and only 
2$ per cent. below 100 e to relate these figures to the Standard 
Deviations of L.Qs. at the ages of testing. 


€ n ration in the past ten years is the rise in occupa- 
tional status, 86-3 per cent. instead of 71-1 per cent. now falling in the top two 
occupational classes, and only 1-2 per cent. as against 6-9 per cent. in the semi- 
skilled class or below. As a whole, also, the group has become noticeably more 
conservative in its attitudes, ` 

Genetic Studies of Genius rem 
favourable prospects of a child w; 
work. Itis good to know that thi. 
on the later careers of the subje 


lains the most impressive demonstration of the 
ith high LO., and a monument to Terman's life 
S work will be carried on and that further volumes 
cts are intended. PEV. 
TRIST, E. L., and Sorrr, C (1959). £ TU at ; , : 
EC eie SES Ni - Explorations in Gr ve s. Leicester 
University Press, p. 68, 7s. 6d. plorations in Group Relations 
in E. first full-scale exper: itain with the laboratory method of training 
ihe Tavistock T es hé 97, by the University of Leicester and 
EL. —Ó itute ions, and was attended by forty-five 
` aut ga ial and non-industrial occupations. 
Through participation in study od epplication groups, the members were led toan 
eines E Psychological forces which operate in such groups. 
Th pi e in such group: 
tailed ot Outlines the meti S of the conference, but gives little 
» a useful bibliography which will guide 
ee her. The method might be of value in the 
inderstood its j re 9 be used with caution and only by those who 
which might be — g DNE ESPHDIR GE dealing wit Sa 
. M. EVANS. 


f PUBLICATIONS RECEIVED 
The mention of 


E A a book in this list neither implies, nor precludes, a later 
- W. TuiBAUT c : = 
J New York. pos KELLey (1959). The Social Psychology of Groups. 

P. Haros (edit.). Papers ¢, ; f ondon : Chapman and Hall, pp. xiii 4-313, 56s. 
Review Monograph No 2 he Kitching of Personality Development. | Sociological 
pp. 149, 12s. (7 Keele: University College of North Staffordshire, 
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A. L. Epwarps (1957). The Social Desirability Variable in Personality Assessment 
and Research, New York: Dryden Press, pp. xiv-- 108, $3.25. 

A. L. Epwanps (1959). Statistical Analysis (rev. edit). New York: Rinehart, 
pp- xii +234, $4.00. Accompanying Workbook, pp- 76, $1.25; Textbook and 
Workbook, $5.00. 

L. Postman and D. A. RILEY (1959). Degree of Learning and Interserial Interference 
in Retention. Univ. California Publ. Psychol., Vol.8, No. 4, pP- 271-396, $2.50. 

M. Reap (1959). Children of their Fathers : Growing wp among the Ngomi of Nyasaland 
London: Methuen, pp. 176, 18s., 

H. H. Somers (1959). Analyse Mathématique du Langage : Lois Générales et Mesures 
Statistiques. Louvain: Nauwelaerts, PP- 40, FB.80. 

P. FRIEDENHAIN (1959). Write and Reveal : Interpretation of Handwriting. London 
Owen, pP- 184, 30s. 

M. M. Crank (1959). Teaching Left-handed Children. Scottish Council for Research 
in Education, Publ. No. XLIV. University of London Press, pP- 44, 2s. 6d. 

1. R. WECHSLER and J. REISEL (1959). Inside a Sensitivity Training Group. Univ- 
ersity of California, Los Angeles : Institute of Industrial Relations, Monogr. 


Ser. No. 4, PP- 133, $2.00. 
R. S. Peters (1959). Authority, Responsibility and Education. London: Allen 


and Unwin, pp. 137, 12s. 6d. . 
P. DIENES (1959). Concept Formation and Personality. Leicester University 


Press, pP- xi+82, 18s. . 

GALANTER (edit.) (1959). Automatic Teaching. New York: John Wiley ; 
London: Chapman and Hall, pp. vili+ 198, 26s. 

A. V. JUDGES (edit.) (1959). The Function of Teaching. London: Faber and 


Faber, pp. 172, 15s. E, " . 

J. PIAGET (1960). Language and Thought of the Child (3rd edit.). London : Rout- 
ledge and Kegan Paul, pp. xxiv 286, 25s. . 

F. V. Sura. (1960). Explanation of Human Behaviour (2nd edit.). Constable, 

. xi-- 460, 30s. m 

V. SouTHGATE (1959). Southgate Group Reading Tests, Test 1 (Forms A, B, C) Word 
Selection. University of London Press, 5d. each test blank, Manual 2s. 6d. 

L. TussiwG (1959). Psychology for Better Living. New York: john Wiley ; 
London: Chapman and Hall, pp- xii-- 496, 48s. 

T. Husen and S. HENRYSSON (1958). Differentiation and Guidance in the Compre- 
hensive School (Report on the Sigtuna Course, 1958). Stockholm : Almquist 


and Wiksell, pp. 196, no price given. 

WorLD FEDERATION FOR MENTAL HEALTH (1959). Africa : Social Change and 

Mental Health. London: W. F. M. H., pp. 40, 3s. 6d. 

D. H. FUNKENSTEIN (ed.) (1959). The Student and Mental Health: An Inter- 
national View (Proceedings of Princeton Conference, 1956). Cambridge, Mass. : 
Riverside Press, PP- xxv +495, 35s. x . . 

F. J. McDONALD (1959). Educational Psychology. San Francisco, Calif. : Wads- 
worth Publ. Co., PP- xix+748, price not given. . 

R. Knicut and M. KNIGHT (1960). 4 Modern Introduction to Psychology (6th edit. 
rev.) University ‘Tutorial Press, pP- 272, 9s. 

B. N. Puurs, R. L. DUKE and M. V. DE VAULT (1959). Psychology at Work in 
the Elementary School Classroom. New York : Harper, pp. xii-- 395, $5.00. 

S. L. Pressey, F. P. ROBINSON and J. E. Horrocks (1959). Psychology in 


Education. New York: Harper, pp. xi+658, $6.50. 
L. J. Cronpacn (1959). Essentials of Psychological Testing (2nd edit). New York : 


Harper, pp. xxi--650, $5.00. i 
G. Murpuy (1958) Human Potentialities. New York: Basic Books. London: 


Allen and Unwin )1960), pP: x-- 349, 255. 
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Teaching in Comprehensive Schools. Cambridge University Press, pP- 48, 2s. 6d. 


M.L. J- ABERCROMBIE (1960). The Anatomy of Judgment. London: Hutchinson, 
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K. R. Hammond and F. Kern (1960). Teaching Comprehensive Medical Care; A 
Psychological Study of a Change in Medical Education. Boston, Mass. : Harvard 
University Press, pp. xxii+642, 89s., 

E. GLOVER (1960). The Roots of Crime : Selected Pa 
London: Imago, pp. xiii+422, 45s. 

J. G. CRowTHER (1960). Francis Bacon : 
Cresset Press, pp. xv 4-362, 35s. 

B. Croruers and R. S. Paine (1959). The Natural History of Cerebral Palsy. 
Cambridge, Mass.: Harvard University Press; London: Oxford University 
Press, pp. xi4-299, 32s. 6d. 

R. G. Anpry (1960). 
pp. xv 4-173, 21s. 


pers on Psycho-Analysis, Vol. 11. 


The First Statesman of Science. London : 


Delinquency and Parental Pathology. London: Methuen, 


THE XIV INTERNATIONAL CONGRESS OF APPLIED 
| COPENHAGEN, AUGUST 13-19, 1961 


_ The XIV International Congress of Applied Psychology will be held 
in Copenhagen at Christiansborg Castle, the Danish Parliament Building, 
August 13-19, 1961. Application forms and detailed information will be 
mailed during fall 1960 directly to registered psychologists in all countries. 
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PSYCHOLOGY, 


VoLUME XXX. NOVEMBER, 1960 Part 3 


LEVELS OF ATTAINMENT OF EDUCABLE SUBNORMAL 
ADOLESCENTS 


By R. E. MORAN 


(Assistant Professor of Psychology Faculty of Education, University of 
Puerto Rico). 


Summary. This study is an attempt to evaluate, by means of specially 
d tests, the academic achievement of a representative 


sample of 300 educationally subnormal adolescents in the 'top' classes of 
Their results are compared with those of 


‘normal’ children with the same chronological ages in a secondary modern 


school and with those of children of similar mental ages in a junior school. 
of the E.S.N. adolescents show a wide range of vari- 


1 below those of the junior school in every area tested, 


emphasis on an 
a relatively superior achiev 


others in the curriculum. 
and emphasis prevailed in all E.S.N. schools, and were directed at all subjects, 


combined with better teaching techniques and sound organization, an over-all 
rise in the general level of attainment could possibly be accomplished. 
However, the writer suggests that the curriculum, although incorporating 
the academic and craft subjects, should be directed primaril 
jal maturity, rather than depending main! 


guidance and soci: y 
in which research has consistently shown the subnorm: 


handicapped. 


J.— INTRODUCTION. 
PsvcHoLocisTS and educators have long been concerned with the standards of 
achievement to be expected of educationally subnormal children in view of their 
low mental capacities. Past studies in this field have yielded discrepant results. 
Thus, Merrill (1922), Wilson, F. T. (1926), and Lewis (1944) have shown that 
the academic achievement o subnormal children approximates and sometimes 
More recently, Bensberg (1953) has found that this 


exceeds their mental ages. | ) 1 
population achieves essentially the same as the normative population for com- 


Renshaw in which 
lower than their gener 
Sleight (1932) give simi 
who are retarded mentally appe : Abr dads 
Gordon (1923), when studying subnormal children residing in the same district 


as many of those in Burt's researcn, 
ment equates its mental ratio. Indeed, he found that his ' B’ group (girls) 


had a spelling ratio (76:7) higher than its mental ratio (74-9). 

Qualitatively, Wallin contends that no real moron (feeble-minded) can do 
third grade work (8-9 year level). Parker, Nemzek, Engel and others claim that 
subnormals finish school with attainments equivalent to fourth grade level 

9-10 years) (Quoted from Merrill, 1922), while Burt (of. cit.), in England, has 
shown that educable defectives are usually capable of accomplishing work done 
in Standard II. (This would be comparable more or less to third grade level in 
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America, as children in England usually commence their formal studies a year 
earlier.) : : 

However, regardless of whether the academic achievement of the subnormal 
child exceeds, equates or falls below his mental age, the procedures or instru- 
ments used in determining this have been constantly criticized by later research 


Conversely, when subnormals' achievements are below their mental ages, 
it is claimed by some that these children are penalized on attainment tests 
standardized on a nation-wide school population. The wide range in standards 


upon mixed groups at different LO. levels are misleading. Another valid 
criticism is that many investigators us 


the slow, apathetic subnormal child. 


to their mental capacities. The approach. 
S of past studies. Only salient observa- 
for fuller details of interpretation and 
t is hoped that the results of this study 
and teachers a more lucid picture of the 


pias ained by children ascertained as educabl 
Subnormal and receiving education in speci y a a ably 


used attempts to obviate the criticism 
tions and results are reported here ; 


IL—DzscniPrioy OF SAMPLE, 


The vast majority of the children test ri IN 
, sted were from the London area. The 
"PW eme grote included 300 subnormal adolescents, ages 14-164- (mean 
‘cht i sw months), 160 boys and 140 girls, consisting of the ' top ' classes 


of eight E.S,N. boys' seni i ; : ? - 
girls senior e Per pa (including one residential) and seven E.S.N. 


2 S One residential). Thus this experimental group is 
Ti 
a Gar okey aT about 12 per cent. of the total 2,180 London senior 
41-90, mean 65.7 (bo - Le. duse LOs. on the Terman-Merrill range from 
Matrices* from 55.99 mean 68-5 and girls’ mean 62-9) and on the Progressive 
Terman-Merrill mental z e nds (boys' 77-7 and girls’ 69-8). The mean 
girls 9:0); mean s age of the experimental group was 9: 5 (boys 9: 10 and 
10:0). ^' OBressive Matrices mental age 10: 7 (boys 11: 3 and girls 


m one junior and one secondary modern school. 
- 9:10, mean P.M, I.Q. 105, mean P.M.M.A. 
among 458 pupils of a three-streamed 
Portion of boys to girls as in the experi- 
for expressing p, — $ 

repared by the Maudsley Hospi 45, ~ rogressive Matrices scores as M.As. and I.Qs. 
Tos. 2 3 ospital Institute of Psychiatry, was adopted (Gwynne Jones. 
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mental group. This junior group offers a sampl i 
children with similar mental ed to those in the cen tetro SEIN 
To determine a level of academic achievement for adolescents of the same 
chronological age as those in the experimental group, approximately fifty boys 
and girls (mean C.A. 14: 4, mean P.M.M.A. 12:3, mean P.M. LQ. 88) road 
selected from a three streamed secondary modern school. Both the junior and 
the secondary modern schools draw their enrollment from catchment areas 
similar to those in which reside the majority of the E.S.N. children, viz. : 
working class and lower middle class residential districts. i U 


IIL—TESTS ADMINISTERED. 


All the tests used in the present study, with the exception of those of reading 
and intelligence, were specially adapted or constructed by the writer, and were 
administered as un-timed group tests in the following order : 

Mental Arithmetic: An adaptation of Burt's General Oral. Test VIII, in 
which a large proportion of the problems involved money transactions and were 
presented in meaningful life-like contexts. 

Craft (Boys and Girls): The boys’ and girls’ craft tests are completely 
different in content, the former measuring, among other things, a specific 
knowledge of tools and their usage, while the latter is predominantly a test of 
general household knowledge and ‘common sense.’ Both tests are presented 
pictorially and were especially constructed by the writer for this study. 

Language : This test, also specially constructed by the writer, is an attempt 
to measure the most elementary language usage and common grammatical 
errors committed by subnormal children. 

Spelling: An adaptation of Cattell's “ Midland Tests 3 ” was made by the 
writer so as to include words least affected by his non-English accent. 

Reading : The Vernon-Warden Group Reading Test A was used, which is 
composed of two parts: word recognition and comprehension. 

General Information : This test, constructed by the writer, is an attempt to 
obtain an indication of the subnormal’s general knowledge, and contains sixty 
questions which cover the following areas: elementary Geography, History 
and General Science, Religion, Everyday Observation, Sports, Current Events 
Laws and Regulations, Civics and Government, Health and Hygiene. 

Intelligence: The Progressive Matrices (1938, Revised Order 1956) was 
administered by the writer as an un-timed group test. Terman-Merrill: the 
most recent I.Qs. of the experimental group on this test were also used as critiera 
for comparing attainment test scores. . . 

Further included was an exercise in which the examinee was asked to write 
correctly and legibly his name, address, date of birth, the name of this school, 
and expected vocation. The writer also prepared a questionnaire which was 
used as a guide for eliciting Heads' and teachers' opinions and attitudes on the 


present E.S.N. curriculum, organization, etc.* 


IV.—METHOD. 

In order to assure a more uniform and perhaps more accurate measurement 
of the subnormal child's scholastic achievement, instead of depending solely on 
standardized tests, new tests were constructed and old ones specially adapted 
for this study. Test items, whenever possible, were based only on material 
which, according to the Heads and teachers, was universally taught in senior 


* A Social Maturity Scale, composed by the writer, and filled out by the teachers, was 
also included in the test battery, the results of which are found in Moran (loc. cit.). 
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E.S.N. schools. Teachers also offered suggestions as to how items should be 
worded in keeping with the subnormal child's paucity of vocabulary. Selecting 
items in this manner gives, to a great extent, a content validity to the tests used, 
in so far as the teachers would not approve the inclusion of items which they felt 
were not suitable for their children. 

In order to mitigate the subnormal's sense of failure, items of tests specially 
constructed for this study, instead of being presented in order of difficulty, were 
arranged so that ‘easy’ and ‘ difficult” ones were interspersed throughout. 
Instead of using tests standardized on a nation-wide school population, the 
norms used here for comparison of the subnormals’ academic achievement were 
calculated from the results of the average children with the same chronological 
age in a secondary modern school, and those of ‘ normal’ children with similar 
mental ages in a junior school. 

It is cogently argued that subnormal children are penalized on verbal 
loaded tests, and that I.Qs. obtained by tests measuring a single modality are 
not reliable measures of an individual’s total mental capacity. To meet this 
criticism, in this study the mental capacity of the children of the experimental 


group is expressed according to their most recent I.Qs. on the Terman-Merrill, 
along with their results on the Progressive Matrices. 


Tests were constructed so as to be given during one class period. A difficult 
test, e.g., arithmetic, was usually followed by one with more appeal, e.g., craft. 
Tests were not given during a rest period, activitity, etc., and if possible neither 
on a Monday nor Friday. 

The size of any one group was limited to twenty, and before commencing 
each sub-test, some time was dedicated to establishing ‘ rapport ' and motivating 
the group. The sub-test papers were distributed separately ; remaining tests 
were kept out of sight and ' retrieved ’ in such a manner as to create anticipation, 
thus arousing interest through curiosity. 

Tests were constructed in such a manner as to require n: di bilit 
items being read aloud by the wri subs: q o: TRACE ed 
largely of semi-literate = illiterate a se a 

The mean Mental Ages of the 8-9 and 10-11 year groups, normal juniors 
(calculated from their converted Progressive Matrices L.Os.) with their respective 
Ee dd on each attainment test, were used as a criterion from which were 
ien s E Mr attainment ages for the different tests. This procedure, 
taal tie dcs Y Schonell and Schonell (1950), is based on the assumption 
by tBé average chil d d € Junior group would be indicative of the work done 
dos leed of 4-11 wick a working class environment, at the chronological 
group. The various telis! in effect, the mental age range of the experimental 
tional age standardiz Bon and | expressed in this way represent a rough educa- 
each test. It also provides r^ p make possible comparisons of each school on 
obtained by the experimental ppreciation of the overall level of attainment 
attainment ages have obvious li o cde ek A pie 
“. . . the mental year, ho s limitations ; though, as Burt (1922) pointed out. 
exact, as it is eminently soraia ude, would seem as a unit, to be sufficiently 
separately. y serviceable.” Boys’ and girls’ test scores were tabulated 


In order to i i ; : 
centage of id Me the dispersion of scores, and to appreciate the per- 
capacity of ‘ doen i experimental group who are working up to the 
School, the median md th dren of the same chronological age in the ordinary 
of the controls in thi € 10th percentile were calculated from the raw scores 

€ secondary modern school on each attainment test. 
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IV.—RESULTS. 
The rank correlation between the mean P.M. I.Qs. and the T.M. I.Qs. of 
the experimental group schools was 0°73, which is reasonable although lower 
n, the author of the Progressive Matrices test. 


than the figure given by Rave [ 
This may be due to three factors : homogeneity of the present group, the low 


‘floor’ of the Matrices which seems to favour the sub-normal child, and the 
fact that the tests measure different mental processes. 


TABLE 1 


ATTAINMENT TEST RESULTS OF EXPERIMENTAL Group (MEAN AGES). 


Total Exp. Group 


=m 


L9 0-00oQq 


Mean C.A. 
Mean PMMA 
Mean TMMA.... 
Reading P 
Arithmetic 

General Information 
Language 

Spelling .-- 

Craft 


1291 00 5 02 01 3 03 


TABLE 2 


AND BELOW THE INDICATED MopERN SCHOOL 


PERCENTAGES ABOVE, BETWEEN 
PERCENTILES. 


Boys 


Above 50th P. | Between 10th and | Below 10th P. 


Subject 
50th P. 


Language 
Reading 
Spelling ... 

Arithmetic. : 
General Information 
Craft 
Soc. Mat. 


28-0 
12-4 
23:8 
48-2 
44-4 
39-5 
50-0 


ibas o 
(b do Óo 4 do s 
Naps 
oZ 
pd R6 


OD 


Language .. 
Reading Ri 
Spelling .... . 
Arithmetic.... eet 
General Informatio! 

Craft 
Soc. Mat. 


one o m N 
ZSRoRwe 
avkwUSE 


The results of the experimental group on the attainment tests expressed in 
mean ages are shown in Table 1, and in Table 2 their results, as compare with 
the controls in the secondary modern school are indicated by percentiles. As 
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TABLE 3 


E.S.N. ScHooLs RANKED ACCORDING TO X 


N P.M.I.Qs. Wiru CORRESPONDING RANKS 


Boys' 
Schools PMIQ PMMA 


AB 
BB 
CB 


= 0 


BIg © 


1 

2 2 
3 4 
4 3 
5 5 
6 8 
7 9 
8 


seen in Table 3 the boys’ schools consistently rank higher than the girls’ in both 
intelligence tests, with one exception on the Terman-Merrill. However, both 
sex groups tend to score higher on the Progressive Matrices than on the Terman- 
Merrill. These findings are Contrary to those of Walton (1955) whose results 
obtained from a group of subnormal children, showed differences in I.Qs., but 
mostly in the direction of lower Matrices results, ~ 


T et esr wide Tange of variability in the level of attainment among 
the, sc ools mn he experimental group. However, according to analyses of 
variance, significant differences between schools occurred only in Arithmetic, 


Reading and General Inform ariations, it seems, may be attributed 
to differences in teachin ; 


enthusiasm for specific subjects in the i ai to elie 
ing erede. e pom. there exists little or no relationship between the rank- 
their ranking on the Pus experimenta] group on either test of intelligence and 
the highest mosione 5 Cees: tests. However, schools which occupied 
in craft: and the bo n intelligence or in the language subjects, tend to score low 
Reg ys’ school ranking highest in craft scored lowest in intellig- 


It is generally beli d i " Ia dH M" ; 
boys are superior to " "es mis girls excel boys in linguistic skills, and that 
proved true i 


In reading we fin 
present study, the me: 
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girls who tend to have lower I.Qs. This superiority of the girls in reading is in 
keeping with the findings of most studies. However, the mean reading score of 
the experimental group was significantly lower than that of the controls with the 
same mental ages. It is interesting to note that the reading test used was 
standardized on a primary school population and that the time limit of 20 
minutes for the age group 6-9 was permitted for the experimental group. If 
mental age is used as a standard, there seems to exist a wide discrepancy 
between reading attainment and mental capacity in the experimental group. 
This is in contrast to the findings of Wilson, F. T. (1926), Bensberg (1953) and 
MacIntyre (1937). As seen in Table 3, there exists little relation between the 

rank orders in reading and intelligence in the experimental group. 
The results of the experimental group in spelling in the present study are 
more in keeping with those of Burt (1922), and Sleight (1932), who found the 
d exceptionally inefficient, than with those of Merrill (1922) 


subnormal chil r í 2 
Wilson (1926) and Gordon (1923) who give evidence of these children’s relative 
red to their mental ages. This can be seen in Table 1. 


efficiency as compa! 
imental group were significantly superior to the girls 


The boys of the expe 1 
in arithmetic, a fact generally accepted if not satisfactorily explained. However, 
the total mean arithmetic score of the experimental group is lower than that of 


the junior group. It must be remembered that the items of the arithmetic 
test were adapted so that most problems involved money transactions, which is 
in accordance with these children’s day-to-day experiences. Thus, the experi- 
mental group’s results in arithmetic in this study are contrary to those found in 
others (Wilson, op. cit., Merrill, op. cit., Bensberg, op. cit., Patterson and 
Philleo, op. sit.), in which subnormals' scores in arithmetic equate or are above 

their mental ages. ed 
The boys' score in the experimental group on general information 1s mark- 
edly superior to that of the girls. This superiority of boys over girls in general 
lligence is in keeping with the findings of other 


information at all levels of inte 
1939). However, as is shown in Table 1, the 


studies (Terman, 1926, Inman, !9- ) [ ] 
children of the junior school obtained a higher mean general information age 


than the adolescents of the experimental group whose average total life experi- 
ence is 5 years 4 months greater. Nevertheless, many children of the experi- 
mental group showed a relatively wide range of general knowledge, which, in 
many instances exceeded that expected of them by their teachers. Further 
enquiry by the writer revealed that many of these children are capable of recall- 
ing and associating facts when these are presented through visuo-auditory media 
such as T.V. and newsreels. 


The boys' and girls' craft tests were completely different in content, and 


therefore, the scores of the two sex groups were tabulated separately. The mean 
craft age of the boys of the experimental group is higher than that of the junior 
school and approximates the mean score of the secondary modern school. 
In fact, one school of the experimental group, the one with the lowest mean 
1.Q. on both intelligence tests, exceeds the mean score of the secondary modern 
school. The mean craft age of the girls in the experimental group is also higher 
that that of the girls in the junior school However, the E.S.N. girls’ mean 
craft age is significantly below that of the girls in the secondary modern school. 
The mean ages of both sex groups is higher in craft than in any other subject 
tested. However, a scrutiny of individual school results reveals that, in craft, 
E.S.N. schools tend to rank in almost direct inverse order to both their P.M. and 
T.M. mental age rankings. This can be seen in Table 3. 

Table 4 shows the results (per cent. correct) of the exercics' 
le task was to write legibly and correctly his name 


e in which the 
examinee's $0 (Christian and 
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surname), date of birth (day, month, year), present age, age next birthday, his 
present address, and the name of his schools. As can be seen, the results of the 
children of the junior school in the age group 10-11 are comparable to those of 
the A and B streams of the secondary modern school, and are significantly higher 
than those of the adolescents in the experimental group. There is a slight sex 


difference in favour of the girls of the experimental group in all parts of this 
exercise. 


TABLE 4 


RESULTS oF JUNIOR, SECONDARY MopznN AND E.S.N. SCHOOLS ON QUESTIONNAIRE. 


School Group or Sex Date of 


Present Name of 
Birth E Address School 


Yo 
9 
d 
7 

10 

8 


Total 


0, 
2-0 
52-0 

2-0 

0-0 

1-8 
90-9 
70-1 
77:9 
74-0 


Table 5 gives an overall comparison of the mean results of the experimental 
group on the spelling, reading 


e and arithmetic tests used in this study with the 
results obtained by the E.S.N. schools themselves, using Burt's or Schonell's 
standardized attainment tests, together with the results cited earlier by Burt. 
It will be seen that the findings are generally quite similar. 


TABLE 5 


Comparison OF VARIOUS ASSESSMENTS oF AnBiLITIES OF E.S.N. PupiL.s (MEAN AGES). 


Subject Burt's Stud 7 
M 
(1919) y Moran (1958) School 


Assessment? 


Spelling 
Reading . 
Arithmetic... 


:5 7:3 
3 
Handwork 


229 


Tuo 


LP (Terman-Merrill) 


5 
Average C.A, : 7 (Progressive Matrices) 


Average I.Q. 


Nore 
BRAS œ 


‘7: (Terman-Merrill) 
‘7 (Progressive Matrices) 


1 See Burt (1922, p. 337). 


? Mean Ages calculated from rı It: 2 3 ; 
schools (Burt or Schonell), esults of most recent attainment tests given in E.S.N. 


E. 


T 
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V.— Discussion. 


The results of the experimental group, as compared with those of the junior 
and secondary modern school, strongly suggest that a small percentage of 
adolescents of limited intelligence are capable of and are actually performing at a 
level of scholastic achievement comparable to that of children with similar 
mental ages in a junior school or in a‘ C’ steam of a secondary modern school 
as claimed by Parker, et al (op. cit.); that there exists an undetermined 
number of children who are potentially capable of working at a higher level, 
but who may not be doing so, due to factors other than that of intelligence ; 
and that, under the present administrative system and school curriculum, the 
majority of the subnormal adolescents approaching school leaving age tend to 
score lower, on the attainment tests used (with the exception of craft) than 
children in the junior school with similar mental ages, and residing in a similar 
socio-economic environment. In other words, their average level of scholastic 
achievement is equivalent to that of a second form in the ordinary junior 
school, as was originally claimed by Burt (1922). 

However, it must be emphasized that the mental ages of E.S.N. children 
should be considered only as an indicator, and not as a limit of academic poten- 
tial. Moreover, the type of test used makes a considerable difference, e.g., 
individual, group, verbal, non-verbal, etc., For it was observed in the present 
study that an M.A. calculated from a verbally loaded test will generally be lower 
than if calculated from a test with a high ‘g, ' k ' or perceptual loading; and 
that the subnormal child's attainments tend to approximate more closely to 
this verbal M.A. On non-verbal tests, subnormals tend to score higher, thus 

between mental age and academic achievement 


increasing the discrepanc i 
the discrepancy between the findings of the present 


This might explain, in part, the disc icy b e 2 € 
study and those of other similar investigations, 1n which, in most instances 


verbally-loaded tests were used, and in which the attainment of subnormals 
equated or surpassed their M.As. It is observed, however, that in the present 
study, the results of the experimental group are more in keeping with those of 
English investigators than with those of American. Thus, one may hazard the 
hypothesis that the closer approximation of subnormal children’s attainments 
to their mental capacity jn American studies than in English ones may be due 

dard of work at the various age or grade levels in the 


evidence supporting this hypothesis is presented in a 
Pidgeon (1958). However, the subnormal's relative 
ks involving spatial or perceptual rather than verbal 
factors, may indicate a need for re-organizing the present E.S.N. curriculum, 
and re-examining the present philosophy and practice of education, n which 
emphasis is still placed on those disciplines in which the subnormal's scholastic 
success depends largely on his competence in handling verbal material Thus, 
it seems logical that the curriculum of secondary special schools or classes for 
subnormal children should be based on those mental processes in which experi- 
mentation has shown these children to be relatively more competent, i.e., 
according to their mental assets rather than to their weaknesses. A lead in this 
direction was taken by Duncan (1942) in England and Winterbourn, R., 
(1944) in New Zealand, who have claimed favourable results. Unfortunately. 
these claims have not been substantiated by other workers. Nevertheless, until 
these mental functions have been scientifically determined, the E.S.N. school 
curriculum could be planned around a programme, incorporatiug the necessary 
basic and craft subjects, but mainly directed towards inculcating attitudes and 
habits necessary for vocational and social competency. Such a programme would 
be reinforced by continual and varied experiences in the area covered in the 


two countries. Some 
comparative study by 
ability to deal with tas 
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i i isi dium 
al information test. Television could be used as a supplementary medi 
i siret by which the subnormal child could be provided with information 
in a form in keeping with his relatively superior visuo-perceptual ability. 


r whole-hearted co-operation ; the Central Research 
Fund of the University of London Which helped defray the cost of test materjal ; 


and Professor P. E. Vernon, University of London, under whose supervision and 
guidance this study was carried out. 
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OBSERVATIONS ON RETEST DISCREPANCY IN 
MENTALLY SUB-NORMAL CHILDREN 


By D. H. STOTT 
(Psychology Department, University of Glasgow.) 


Summary. Retest scores of subnormal children were examined in order to 
check the reliability of previous retests and to estimate to what extent the 
discrepancies might reflect real changes in the rate of mental development as 


well as errors of standardization. 
The discrepancies were considerably greater than those obtained from the 


standardization of the Binet by Terman and Merrill. The usual tendency 
towards increasing discrepancy with time-interval was confirmed, except for an 
apparent practice-effect at short intervals. The gains and losses were much 
more heterogeneous than could be accounted for by intra-test error. Boys 


lost significantly less than girls. 
It was concluded that the retest discrepancies might in large part be due 


to real variations in the rate of mental development (including such as are 
dependent on changes in motivation). 


[.—ScoPE AND PROBLEMS OF THE ENQUIRY. 


Ir was observed in a limited number of special schools for educationally sub- 
normal children (not included in the survey reported below) that I.Qs. on retest 
often showed greater fluctuation than that considered the rule for children of 
this type (Terman and Merrill reporting an average difference between scores 
on the L and M forms for the below-70’s of only 2} points). The present enquiry 
is an attempt to assess the generality of this observation and the extent to which 
the discrepancies on retest may reflect real changes of cognitive or conative 
development. 

It is first necessary to consider causes of discrepancy which arise from the 
procedure of testing or from the nature of the tests. The chief of these are 
probably (a) the varying ability of different testers to secure rapport; (b) the 
use of different tests; (c) test-sophistication ; and (d) faulty test-standardiza- 
tion, especially in respect of subnormal children of school age. 

Each of the above factors is examined below, but the fourth involves some 
theoretical points better mentioned at this stage. Discussing the deviations from 
variance-constancy found in the standardization of the Terman-Merrill revision 
of the Stanford-Binet, upon which their Correction Table is based, Fraser 
Roberts and Mellone (1952) write : “ It is most unlikely that the variations in 
variance have any particular meaning in terms of real mental growth; they are, 
doubtless, due to the way the scale is constructed and the waythe measurements 
are used." They give no reason for this conclusion, although they quote 
McNemar (1942) as pointing out that the 6-year level of the test has insufficient 
items of medium difficulty, while the 12-year level has too many. 

It cannot, however, be taken for granted, as do Roberts and Mellone, that 
all observed variations of variance as between one-year-level and another are 
due to faulty standardization. They may also be caused, wholly or partly, by 
real variations án mental growth as between sub-and supra-normals. The 
latter hypothesis is the more recent one, and has been rendered feasible by the 
general recognition of large environmental influences in mental development and 
of the contemporary nature of mental measurement (Vernon, 1959). It is 
represented by Clarke and Clarke (1958), who take up what must be considered 
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prima facie a biologically reasonable position : “ During childhood and adoles- 
cence, mental (like physical) growth does not proceed at a uniform rate and the 
individual’s position with respect to others in his group tends to vary from year 
to year, this being reflected in changes in L.Q." In support of this view, they 
refer to an estimate, based on Thorndike's summary mentioned below, that 
correlation between test and retest decreases on average by 0-04 per year. Since, 
they argue, errors of measurement are unlikely to be cumulative, “ this must be 
a reflection of real change in intellectual level due to different rates of development," 
(their italics). However, they do not refer to the alternative possibility that 
discrepancies widening over certain age-ranges may arise from faulty test stand- 
ardisation. They quote certain studies which report large changes in I.Q. Of 
these, that of Honzik, MacFarlane and Allan (1948) is the most spectacular ; 
but, as the Clarkes point out, the I.O. differences represent the greatest diverg- 
ences registered by each individual between the ages of 6 and 18 years, which 
not only maximises accidents of the test-situation, but eliminates a number of 
lesser variations and identical retest scores which would give a more balanced 
picture. Vernon also points out (1955) that the children who were the subject 
of this research were of superior ability, and the Clarkes agree that this would 
give a great range for variation over a large number of years. The Clarkes' own 
study (1953) is also relevant: they found, in the case of over 100 adult mental 
defectives, that those who came into a mental institution from very bad home 
environments showed a mean rise of 10 I.Q. points two years after admission, 
compared with those from less adverse environments, who only showed the 
equivalent of the normal effects of test-practice. 


Thorndike (1933) made a rough average of a number of studies which had 
found increasing discrepancy with le. 


n 1 1 ngth of time between tests: whereas the 
correlation for an immediate retest fell close to -90, for a test after a 5-year 
Se it Hes estimated to be about -70, giving an average annual decline of 
f a 2 i ater grow of work on I.Q. constancy, Thorndike (1940) quoted a 
study by Brown (1933), which reported a drop of retest correlation from :86 after 
an interval of less than two years to -61 after five years or more ; and a study by 
Hirsch, reanalysed by Miller (1933), which reported a progressive increase in 
the median difference in L.Q. from 5.4 at one year interval to 7-3 poiuts at 
five-years. It seems, in summary, well established that divergences on retest 
tend B be pa ei ee A s interval between the tests. 

e present enquiry is based on the assumption that if retest discrepancies 
ett B. eme of change in the rate of theta growth, they will shar two 
joe a T a they will be heterogeneous ; that is to say, even though 
aki a cB d : uk DR upwards or downwards, there will be a sufficient 
of thosein (ho diem ro iia n Suggest significant individual variation, and 
tendency Jor the Peso: €t trend the variance will belarge. Second, the 


1 l neies to be greater the longer the interval between 
testings should ideally also apply to those against the fet trend. 


é II.—DESIGN or THE EXPERIMENT. 
arrangement wi i iati 
B. o e E at ber the Special Schools Association, head-teachers of day 


mange 50-15) aver the m sub-normal children (of I.Q. largely within the 
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should be continuous series, and not examples of divergence or otherwise. The 
pro forma asked for the ages of the children at each testing, the test used, and 
who tested. 

Data were obtained for 952 c 
tested three, four or five times. 
compare each test-result with each ot 


hildren. Some of these, however, had been 
In these cases the logical procedure was to 
her, so that three testings yielded three 
comparisons, four six and five ten. The total of comparisons was reached 
1358, Of these, both test and retest were made by the same tester in 830 cases, 
and by different testers in 528, All were the appointed examiners of local 
education authorities, namely educational psychologists or medical officers. 
Classification by length of interval between tests had to be arbitrary, but it was 
thought best to make sure of including within the shortest interval what were 
intended to be annual retests. Consequently, to allow for the possibility of these 
being made during the fourth school term, that is, a term in arrear, the shortest 
interval was fixed at 1 year 4 months. The remainder were fixed as follows : 
1 year 5 months to 2 years 4 months; 2 years 5 months to 3 years 4 mouths ; 
and 3 years 5 months or more. No trial was made of any alternative division, 
so that the above may or may not approximate to the one which would bring 


out the greatest interval-differences. 


III.—FINDINGS. 
The distribution of the total retest discrepancies is shown in Figure I. 
28-0 per cent. were of 10 1.Q. points or more; 11-1 per cent. of 15 points or 
more; and 3:5 per cent. of 20 or more. In the table which Vernon (1957) 
publishes in Secondary School Selection (Appendix B), for 29 per cent. of I.Qs. 
to change 10 points or more, and 3 per cent. to change 20 or more corresponds 
an and Merrill claimed a reliability of -98 for the 


to a correlation of -80. Term i 
below-70's and :95 for the 70-89 group. There are, however, several possible 


explanations of this disparity. The examiners for the T-M standardization were 
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working self-consciously upon a project, whereas the testing in the present 
sample was done in the ordinary course of administrative ascertainment and 
supervision. The intervals between retests were shorter in the standardization, 
so that there would have been a greater tendency for a mean rise from practice- 
effect to cancel deterioration of performance by subnormals. Lastly, it can be 
calculated from the distribution graph (Lerman and Merrill, p. 37) that their 
sample covered only about 144 children of all ages of I.Q. below 75, that is, only 
about one-tenth of the present sample. 

The mean discrepancy, disregarding direction, was 7-098 points. It is 
seen that there is a consistent displacement to the ‘ minus ’ side (falls in I.Q.). 
If the fifty retests which registered no change are allocated equally between the 
rises and falls, the mean of the rises is 6-512 and the mean of the falls 7-449 ; but 
since there were nearly twice as many falls as rises the net mean discrepancy, 
taking account of direction, was —2-194. Despite this general tendency, 
consistent with the findings of Roberts and Mellone, for the I.Os. of subnormal 
children of school-age to fall, there is a sufficient heterogeneity of retest dis- 
crepancy to render feasible the hypothesis that the mental capacity of a signi- 
ficant proportion of the children changed significantly. For the present argument 
a discrepancy of plus or minus 10 points may be taken as an arbitrary criterion 
of significant change (for whatever reason). Assuming that some systematic 
Correction is needed on account of faulty test-standardization, the above will 
have to be adjusted to allow for the net mean fall of just over 2 points, giving 
as rough criteria of significant change gains of 8 and falls of 12 points. On this 
basis there were 166 significant rises and 178 fa'ls, which combined amounted to 
25.3 per cent. of the test-retest comparisons. Reference to the Roberts- 
Mellone Correction Table shows that for subnormal children the only gains 
needing correction occur in the age range of 5 : 0—7 : 11, and the corrections 
recommended are deductions of | to 4 points from 5: 0—6 : 11 and 0 to 3 for 
T 5L i1. In the present study 16-3 per cent. of the first tests in the total of 
retest comparisons were made when the children were in the former of these age- 
groups and 16-4 per cent. when they were in the latter. Thus, while the falls 
could be partly accounted for by errors of standardization, the gains could not 
be so, since either they ran counter to the recommended corrections or if in the 
same direction were not of the same degree. The heterogeneity cannot, there- 
fore, in the main be due to faulty standardization, whatever this may mean. 

The other feasible causes of retest discrepancy must be evaluated at this 
"ei The test used was almost exclusively the Terman-Merrill revision of the 
DIE IUE (the few others being one cach on the Porteus and Alexander 
ud | geom ante five unknown), So that the use of different tests as a reason for 
is afc can be ruled out. Where, however, the children were tested 

y à iHlerent tester there is a tendency to greater variability. The mean dis- 
crepancy with the same tester was 6-80, and with different one 7-49. This is 
— at between the 2 per cent. and 5 per cent. risk of chance ( =2-243). 

" hup mu C. amne made below from the tendency of the discrepancies 
f n8t of retest interval, the Possibility must be considered that 
the latter might in part be due to the greater probability of a different tester 
per E "mas ie € In fact, as Table 1 shows, the greater over-all 
mea j r different tester ' e entirely withir tor inter? 
(upto 2 : 4 years). At the longer ole e tene ee 
tester ' were actually slightly less than for ' same tester.’ 

This disposes of the possibility that the change of tester might have produced 
the widening discrepancies over the longer intervals. It can also be said that 
while the human factor of the tester has some influence it isless than might have 
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TABLE 1 


Mean Retest Discrepancy (I.Q. Pornts). 


Same Tester Different Tester 


Retest Interval : 
5-495 6-766 


7:628 


6-631 
8-598 


been thought, and certainly not the main reason for the retest discrepancies 
observed. 

The above-mentioned tendency for the test results to be more discrepant 
over the shorter intervals only with different testers seems an anomalous one. 
It may possibly be explained by test-sophistication. Elwood (1952) found that 
retests in which the same form of the Terman-Merrill scale were used gave 
slightly higher I.Qs. than those with the other form. It is also possible that the 
mean rise in I.Q. which Rudolf (1949), using the Wechsler scales, found in 
institutionalised defectives after intervals varying from six to eighteen months 
was also due to test-sophisticatton. 

In the administrative situation in which the presen 
out, the same tester retesting a child after a comparativel 


normally take the precaution of using the alternative form ; another tester 
would probably have no record of which one was used for the earlier test. Thus, 
it is possible to hypothesise à practice-effect, or more accurately, a memory- 
effect, when subnormal children are confronted with the identical version of the 
Terman-Merrill up to intervals of 2 years or so. If this is well-founded, the 
discrepancies in question would be predominantly increases, despite the over- 
all tendency to decreases. Table 2 shows that there is, indeed, a tendency in this 
direction : the only cells in which the plus discrepancies have higher means 
than the minus of the same retest interval are in the shortest interval for 
“same tester’ and the two shortest for ‘ different tester.’ Nevertheless, this 


t testings were carried 
y short interval would 


TABLE 2 


Mean Pius AND MINUS DISCREPANCIES BY TESTER AND INTERVAL. 


Same Tester Different Tester 


Retest interval : 
0-1:4 years... 


:5—2 -4 years 


: 5—3 - 4 years 


:5 years or over +- 
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defiance of the fall-tendency is not quite strong enough tobe significant in terms 
of mean differences. In the different-tester group for the two shorter intervals, 
the mean of the +’s is 7-698 and of the —'s 6-750, giving a of 1-467. However, 
if the number of cases in which the I.Q. rose is compared with the number of 
falls, irrespective of amount of either, the preponderance of rises within the 


two shorter intervals of the different-tester group is significant (Table 3) at 
the 0-1 per cent. level. 


TABLE 3 


NUMBER or Rises AND FALLS IN LQ. IN DIFFERENT-TESTER GROUP. 


Two shorter intervals 


Two longer intervals 


Expected incidence in brackets 
X°=13-89 p<-001 


In general, therefore, the h: 
seem to be borne out. On the 
crepancies to be greater with th 


ypothesis of a degree of test-sophistication would 
bed hand, since the tendency for the plus dis- 
$ à € shorter intervals of the same-tester group is so 
slight, this factor of test-sophisticatio i i - 
icenisof thedata below n may be discounted in the further treat 


The tendency for the retest discre ancies to increase wi i can 
be best studied, therefore, within the cii group of dee eem are 
sufficient cases (830) for the different-tester group to be dispensed with. 

In Table 4 the mean discrepancies within every interval are compared with 
every other. It is seen that there is a significant enlargement in the retest dis- 
crepancy as between every shorter and every longer interval, except for the 


shortest with the next shortest, and even thi j igni 
: uy s only just falls short of significance. 
Doe c work has shown, the longer the retest interval the greater the 


Such a trend is consistent with the hypothesis that subnormal children are 


bject t i F E ^ 
Sd he eiie T T mental growth, It is doubtful if such consistency 


correct test-standardizati Even if it the 
enlargements of the discrepancies t mult ndn pteeeeoreieete 
ETE oic kere Pis e produced would be in one direction only, 
besides the marked one in 
A further means of tes 


Š In casework among backward chidlren of school age 
pi oe B boys acquiring new mental resources from the beginning © 
B he c in part E fos most marked where the mental retardation seemet 

i : ype of congenital i irment whic 
P ied vec Vnforthcomingness (Stott, 1959, a, by "This is reneenicel by 
Ser : e name of Antriebsmangel (lack ©: 
motivation), and is thought Bapo tetekim anis ts D sb 
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TABLE 4 


INCREASE IN Retest DISCREPANCY WITH Tzsr-INrERVAL (SAME TESTER). 


Test interval 3:5 or more 


(a) | 3:421 
(b ^ 0-552 
1—6-197 
p«-001 


(a) 3.386 
(b) 0-483 


t—7-011 


(a) Difference in mean discrepancy. 
(b) Standard error of same. 


characteristic is a lack of the assertiveness which prompts a normal child to face 
strange situations and tasks, so that the unforthcoming child appears as à 
‘mousy,’ confidence-lacking type (Stott, 1956, 1958). It is notable that boys 
of this type, but not in the writer's limited experience girls, tend during adoles- 
cence to acquire the assertiveness and confidence needed for tackling reading ; 
and it is reasonable to suppose that they would also be able to face the tasks set 
byan intelligence-test with greater confidence and hence with a greater chance of 
success. k 

There thus emerged the possibility of a sex-difference in rate of mental 


development. To test this it was necessary to approach once again the heads of 
had supplied the original data in order to find out 


the thirty-four schools who hac . é 
the sexes of the children. This information was obtained for all but four schools, 


in which there had been a change of head or the children could not be identified. 
In one of these schools the sex of two children—boys with L.Q. losses of 12 and 
15—was identifiable, and these were included, since they told against the 
hypothesis of sex-difference- The numbers of test-retest comparisons in respect 
of children for whom the sexes were known were 632 for boys and 643 for girls, 
giving a total of 1,276 compared with the original one of 1,358. The verification 
of a sex-difference required here, not the ascertainment of larger or smaller 
discrepancies as between boys and girls, but an over-all tendency for boys to 
gain more frequently, or lose less so, than girls. It was found that the boys 
suffered a net loss of only 1-189, compared with 2-985 for the girls. With 
t—3-110, this was significant at less than 1 in a 1,000 risk of chance. The sex- 
difference thus established must be held to be evidence of differing individual 
rates of mental development, for it is hardly conceivable that this could be due 
to intra-test factors. A corollary is that, if there remains any logical justification 
for a correction table, there should be different ones for each sex. 
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IV.—Concrusions. 


The tendencies for retest discrepancies to be heterogeneous, to enlarge with 
the length of the interval between tests, and for backward boys to lose signific- 
antly less on retest than backward girls, suggest that the observed variations in 
variance of the Binet tests are due in large part to real changes in the rate of 
mental development of subnormal children of school-age. This is not to deny 
that part of the variation is also due to faulty standardisation. Close study of 
the percentages of children of various ages passing each age-level of the test, 
as given in McNemar’s (1942) Table 26, reveal obvious inconsistencies of stand- 
ardization, and it would be surprising if these were not reflected in distortions 
of variance. For example, the 5-year level is much too easy for children of 
5 years and over—presumably also for subnormals for whom it is liminal— 
while it is much too hard for children under 5 years. The 9-year level is again 
too hard for children below that age. Nevertheless, the utility of a correction- 
table is doubtful, in view of the difficulty of isolating the influence of faulty 
standardization from real accelerations or decelerations of mental development. 
In using such a table there is a danger that real changes of mental growth may 
be concealed. 2 

It may also be asked whether faulty standardisation has any meaning 
except in the sense of a faulty methodology, such as reliance upon too small 
samples of subjects. Cannot the curious trend of the mean percentages of 
normal children of different ages who pass the 5-year level, referred to above, 


be due to a jump in the intellectual development of children between 4 and 5 
years? It may, indeed, be virtually im; 


not change for natural reasons. B 
ability of any individual is the end- 
interaction, one would expect a widenin 
not-so-well mentally equipped, and bi 
favoured. If we ignore this, and ' st 


may cover up this or other natural changes 
€ct be neatly, but falsely, standardized by 
eak or the environmentally less favoured. 
to adopt a more empirical approach to 


Finally, it cannot be assumed that s 


i n , ability to di 
time at an intelligence-test—. A N 


sent changes in moti ti diff d ; 
of mental organisation and eoten et otivation, or a different degree 
of mental operations (Piaget. 1951) ation resulting from changes in the range 
in motivation. 
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REFLECTIONS ON RESEARCH ON ALLOCATION TO 
SECONDARY EDUCATION 


By R. R. DALE 
(University College of Swansea.) 


Summary. Research on allocation at 11+ tends to ignore some aspects of the 
Situation. The concept of failure in the grammar school is too imprecise. When 
correlations are made more attention should be paid to the distribution of 
movement between the variables. Prediction research should take account of 
the difference between the performance of boys and girls in the grammar 
school, and between girls’, boys’ and mixed schools. Low correlations may be 
partly due to bad educational guidance. Streaming in the grammar school 
leads to a crystallisation of differences and an artificially high correlation. 


Prediction research might investigate the possible existence of different 
subject arrays for different social classes. 


L—THE CONCEPT OF FAILURE IN THE GRAMMAR SCHOOL. 


THE writer has been working for some years on problems relating to the selection 
of university students, selection of teachers, and various aspects of co-education 
at the secondary level. Here and there this work enables one to look at research 


on allocation at 11-4 from a slightly new angle, and to make various tentative 
suggestions. 

There is a large comparative element 
grammar school. It is natural for the tea 
class, say, half of its pupils as a 


quarter as inferior and ' failures.’ Now even if selection were perfect, resulting 


in the assessment of failure at the 


tesearch concerns itself more specifically 
of merit, its findings might be couched i 
this failure phenomenon, The lack of 
grammar school is, of course, a handic 


As ‘an extension of this section, 


; nd the standard so high that there are no 
failures (e.g., at Ordinary level): also £ lati the 
standard so low that all pupils fail, O hon elation to be perfect and 
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that the movement between the two variables may be relatively more con- 
centrated either in the upper or the lower part of the distribution, thus causing 
a discrepancy between the number of misfits, calculated from the correlation 
co-efficient taken at its face value, and the number actually occurring. Such 
concentration of movement is, of course, more likely at the bottom end of the 
grammar school allocation owing to the greater homogeneity of this border zone 
group. Second, the extent to which the correlation departs from perfection 
may be brought about, mathematically speaking, in a number of ways—by a 
large number of small movements between the variables, by a much smaller 
number of big movements, or by a mixture of these ; the number of misfits 
will depend upon which of these operations is occurring. 

Research might also make further investigations into the influence of 
major adventitious occurrences On prediction, e.g., prolonged illness and 
absence, because there is an unconscious tendency for us to blame the allocation 
examination for failing to diagnose brain tumours, parental divorce, and the 
deaths of pupils' mothers. It appears to be a tenable argument that if one is 
assessing the power of the allocation examination as an academic measure one 
ought not to compare it with perfect correlation, but first deduct from the 
latter that decrease due to adventitious circumstances. 


IIL—PREDICTION BY SEXES. 


Many of the allocation studies have made no distinction between the sexes, 
and have made one predictive estimate for both combined. This would be 


valid only if it could be shown that they make equal progress in the principal 
subjects and if their overall performance at the criterion stage were virtually 
Unfortunatly, however, this is not so. Girls in girls’ schools fare 
appreciably better than boys in boys’ schools at the Ordinary level of the 
General Certificate of Education. Nor is this superiority spread out evenly over 
all subjects ; in mathematics and physics there is even a feminine inferiority, 
though this is not immediately apparent when the percentages of passes are 
compared. (Sutherland, 1959). The case, therefore, for separate prediction 


systems for boys and girls should be examined experimentally. To calculate 
joint correlation co-efficients, often with unequal numbers of the sexes in the 
Even to calculate these co- 


experiment, is scarcely t 
the sexes and then combine them is not valid at the 


down, producing à lower corre. 1 
For some individual subjects such as mathematics a: 
differences become so pronounced that research wor 
warily ; here the situation js also bedevilled by differences in educational guid- 


ance, as we shall see in the next section. 


III.—PREDICTION AND EDUCATIONAL GUIDANCE. 

always what they seem. Part of this feminine superiority 

in the Ordinary Jevel examination is due to the predominance in the grammar 

school curriculum of subjects which have a large literary element and involve 
to subjects such as mathematics (most 


considerable memorisation, as opposed ] 1 
parts) and physics, in which the ability to reason in mathematical terms 1$ 50 


Nor are things 
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important. This is a form of educational guidance which favours the girls and 
is even now being modified by the increasing emphasis on science. Again, part 
of this apparent superiority is also due to marked differences in educational 
guidance within girls’ schools as compared with boys’ schools. A high propor- 
tion of girls, particularly the weaker ones, are permitted to drop mathematics, 
physics and chemistry at the Ordinary level (and often well before) and take 
biology instead, which better suits their abilities and interests. The correspond- 
ing movement in the boys’ schools, i.e., the dropping of literary subjects by the 
weaker boys, has made much less progress. There is also a small age difference 
in favour of the boys. We are not concerned at the moment with the rightness 


or wrongness of this guidance, but with an attempt to clarify the position for the 
research worker. 


O-called misfits are created by bad educational 
: guidance. There isa tendency for 
cience ’ courses because their friends are 
t ather wishes it, etc. It has even been known 
for the bias of the headmaster to cause a similar pressure. Clearly, it is wrong to 
T nination for faults in guidance which cause a student 
to fail. In order to put things in the right perspective, it must be acknowledged 
nd good guidance. 


IV.—Prepicrion AND SEX OF SCHOOL. 
In the present educational situatio; 


He E ex of its pupils. If we take the Ordin 
level examination results of maintained grammar cols as a criterion, without 
a ex taking each subject, 

Thisb or in almost all subjects (Tyson 1930, King 1949, 
. s by no means necessarily indicates any innate differences 
or grammar school work, but it does 


Superior to those of a boy ina boys’ sch GIAN pigris school ate 


ool. 


OyS' schools, while the girls are slightly below 
he girls in Tyson 193 vs 1925 159), 

girls’ attainment in ces ed is " m pk iei ite peel 
this rule (Cameron 1923). If social 


but we can say with some certainty 


ri ics i i i i 
EOS mathematics In a mixed school than in a girls’ 
, Verage tendencies and may be reversed in 
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individual schools by the presence of a good teacher in the girls’ school and a 
poor one in the mixed school. 


V.—PREDICTION AND THE CRYSTALLISATION OF DIFFERENCES. 


This concept is best understood through an actual example. Let us suppose 
i 11- examination have been allocated to the one 


that the top 120 boys in an 
local boys’ grammar school. It is a usual custom for a school to place these 
boys in order of merit in Forms A, B, C, and D. This procedure in itself will 
create a crystallisation of the difference between the top thirty in Form A and 
the next thirty in Form B, and so on. In other words, it is a force working 
against any reduction in the correlation coefficient between the allocation 
examination and any first, second—or even third year, criterion in the grammar 
school. Though free movement of order of merit is possible within each form, 
this movement is impeded (though not prevented) between forms. In addition, 
since the average level of academic aptitude will be highest in Form A, these 
pupils will work at the quickest pace and the difference in attainment between 
them and the pupils in Form B will be increased, and so on down to Form D. 
Again, this will tend to maintain prediction at a high levelif calculated by means 
of correlation coefficients. The authorities do not say : “ We have predicted 
the nature of your grammar school performance and now we are going to make 
sure that we are right," but in effect this is what they do without knowing it. 
It is rather like a specialist in hospital who took great pains to diagnose the 
diseases of groups of patients and later went round with germ-laden hypodermic 
syringes to make sure his diagnoses came true. 42 m. 
This argument, therefore, casts some doubt on the validity of estimating 
the number of grammar school misfits by correlating the 11-+ examination 
with grammar school criteria a' e fir md and even third year. 
As most readers will know, 2 somewhat similar phenomenon 1S frequently 
observed in secondary modern pupils compared with secondary grammar pupils. 


VI.—PREDICTION AND SOCIAL CLASS. 

between overall prediction 
ion of the possible exist- 
ence of different patterns o for different social classes in the subject 
nia ' For wer e, the prediction of success for the children of unskilled 
labourers may i in English language from that in mathematics. 
Tf this is so, the neglect of this factor, or of the sex factor, may have caused 
some of the slight discrepancies in research findings for which no explanation has 
been put forward. 


VII.—THE CRITERION. 
en the allocation examination 


ich they themselves choose or arrange. They 


prediction. But is it not possible that it is in part the criterion which is faulty— 
that neither its reliability nor its validity are perfect? If this lack of perfection 
Jationship with the allocation examination 


prings the criterion into a closer relations? 0 
than the true one would have done, jt will cause a spurious decrease in the 


number of misfits ; ifit moves in the opposite direction it will cause 2 rise. 
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PERSONALITY ASSESSMENTS AND ACADEMIC 
PERFORMANCE IN A BOYS’ GRAMMAR SCHOOL 


By E. ASTINGTON 


(Senior Me odern Languages Master, Stockport School.) 


Summary. This study has as its origin the observation that over 40 per cent. 
of grammar school candidates for the G.C.E. of the N.U.J.M.B. pass in fewer 
than three subjects. The hypothesis is considered that this unsatisfactory 
position might be improved by taking into account in 11-+ selection those 
hich make for academic success. 

tempting to determine the personality qualities 
demic success or failure, and after some preliminary experi- 
ded to use teachers’ ratings, class-mates' ratings and a 


300 boys in five primary schools in a county 
i ce, independence, 


borough were rated by their c 
dominance, in s and emotional stability. Some 700 boys in 


the maintained boys' grammar school in the same town were assessed in the 
same years for the same six qualities by the ratings of three teachers, class- 
mates’ ratings and a questionnaire ; in addition, introversion-extraversion and 
sessed by the questionnaire and sociability also by class 


sociability were as 
mates' ratings. 
At all levels, successful boys received significantly higher ratings than 
tence, independence and interest ; dominance seemed 
vement ; successful boys 


unsuccessful boys for persistent d j ; 
to have no consistent connection with academic achie’ 

emotionally 
verted 


showed a slight tendency to be considered more nervous but more 
stable ; successful older boys tended to consider themselves less extra’ 


and less sociable, than their unsuccessful fellows. 
Grammar school teachers’ ratings of persistence, independence and interest 
showed an average correlation of about -60 with performance in internal school 
examinations and with G.C.E. achievement. They differentiated significantly 
between groups of G.C.E. candidates, matched on I.Q., but of contrasting G.C.E. 
Ratings made in the second year of the grammar school predicted 

1.Q., but less well than second-year examina- 


performance. 
G.C.E. performance better that 
tion performance. -— . E . 
Primary school teachers' ratings of persistance, independence and interest 
with the boys' 


lation, which increased with time, 
school achievement. These personality ratings showed 
higher correlations with G.C.E. performance five years later than did 1.Q., but 
were less good predictors than performance 1n the 11+ examination. On this 
evidence, the use of such ratings might be considered as an element in 11+ 
selection, but they ought not to be used as a substitute for the 11+ examination. 


jw corre 


{ENTS AND PERFORMANCE IN SCHOOL EXAMINATIONS. 


In 1953-4 and 1954-5, boys of Stockport School, a maintained boys’ grammar 
school, were assessed for different aspects of personality in three different ways : 
by teachers’ ratings, by class-mates’ ratings and by a questionnaire, a form of 


self-rating. 

In 1953-4, 678 boys, in twenty-six different forms, ranging from the first 

year to the lower sixth, were rated by thirty different masters, on a five-point 
i 4-2 to —2 for each quality, details of which will 


descriptive scale ranging from 
be found in the Appendix. Each boy was rated by three masters who were 


teaching him in that year, so that each boy’s total score for each quality ranged 
from +6 to — d by any master was between 


I.—PERSONALITY ASSESS) 


6. The average number of forms rate! 
225 
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two and three, although some rated as many as four. The ratings were carried 
out in the second half of the academic year, between January and June, so 
that every master would have taught the boys he was assessing for not less than 
five months. 

In 1954-5 the experiment was repeated : 731 boys were rated by thirty-two 
different masters, between February and June, 1955. 

Class-mates' ratings were collected at the end of each academic year. The 
Writer visited each form in turn and after handing out blank sheets of paper and 
warning boys not to write their name on their paper, gave the following instruc- 


“ Have you ever tried to judge the character of the other boys of you 
form? Here is how to do it. I am going to read 


on the paper. This has nothing to d 


. You will find that the descriptions are insets of four. The first descrip- 
tion in each set is of a boy who possesses a great deal of the quality described ; 


the second boy possesses a fair amount ; the third t got t lity at all 
and the fourth has very little. 7 ie tied has not got the quali itn 


“ Listen to the description of the first b i it hi d 
Details of the character-sk we i ba dr) ing ye 


etches are given in the Appendix, where it will 
be observed that they are based fairly closely on the rating-scales used by the 
teachers. A score of +2, 41, _9 or —l was awarded to each boy, each time 
he was mentioned in one of the four 


algebraic sum of these Scores, divided by th i f 
boys present inthe ne inl Y the number of raters, 1e., the number o 


In July, 1954, 678 
1955. 


Personality traits Studied: dominance, 
: re ‘Toversion, nervousness, persistence, sociability, 
emotional stability, and io In the 1955 version, ten questions on 
: Were in, i ranged 
cyclically and in alphabetical ord aged, The questions Ar Ar DEA 


far he thought each statement aie ee oe Tha boy dicated boi 


AB, C D, orf Was give Q4, 1 every Subject which was examined, a grade of 
Proportions of A 10 je oe E rades being awarded to the year-group in the 
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divided by the number of subjects in which he was examined and converted into 
a percentage. Boys who were absent for terminal examinations were not 
considered, nor were boys transferred from other grammar Schools in the 
course of the year. In the fifth year, the twenty-five or so promoted directly 
from the fourth form to the lower sixth were counted as successful and the group 
of thirty was completed by the five or so with the highest achievement index in 
the fifth year forms. Those who were in a fifth form for a second year were 


excluded from this study. 
The successful group of about 150 boys consisted then of five sub-groups of 
thirty successful boys, i.e., the top 25 per cent. in each year-group from the first 


year to the fifth year ; similarly, the unsuccessful group comprised the 150 boys 
falling in the bottom 25 per cent. of their respective year group. In com- 
parisons of the mean score for personality assessments of successful and 
unsuccessful boys, the *t'-test of significance was applied; the level of 


significance is indicated in the tables of results. 
TABLE 1 
PERSISTENCE.—MEAN SCORES FOR SUCCESSFUL AND UNSUCCESSFUL GROUPS. 


Successful Unsuccessful 


N M 


Class-mates’ ratings 
(+2 to —2): 
1953-4 
1954-5 


Questionnaire 
(+20 to —20) : 
1953-4 


r persistence ratings given by both 
mates at all age-levels. With only two exceptions, for class-- 
mates' ratings in 1954-5, these differences were significant at the 1 per cent. 
level; the two exceptions were significant at the 5 per cent. level. 

In the questionnaire scores, à significant difference in favour ofthe success- 
ful sub-group appeared only once, at the 1 per cent. level, for the fifth year sub- 
group in 1953-4. In no case did an unsuccessful sub-group achieve a significantly 
higher score than the corresponding successful sub-group. 

hree different assessments made of individual boys 


Significant differences Were found fo! 


teachers and class 


ence in the questionnaire, wherea: r 
by three of their teachers and thirty of their class-mates. It may be that such 


boys set very high standards for themselves. 
On the other hand, many unsuccessful boys gave themselves very high 


scores for persistence, whereas teachers and class-mates gave them low assess- 
ments. Such boys may be either deliberately distorting their score or may have 


very low standards of persistence. 
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TABLE 2 


INDEPENDENCE.—MEAN SCORES FOR SUCCESSFUL AND UNSUCCESSFUL GROUPS. 


Successful Unsuccessful 


N M M 


Teachers’ ratings : 


41:67 —0-80 
4149 —0-68 


+0-18 —0-23 
+0-14 —0-20 


+3-85 
+3-95 


For teachers’ ratings, significant differences at the 1 
for all sub-groups, both in 1953-4 
1953-4 and 1954-5, the differenc 
reach the 5 per cent. level of sign 

In the questionnaire scores, 
in 1953-4, showed significant diff 
of the questionnaire scores for p 
a significantly higher score than 


per cent. Jevel appeared 
and 1954-5. For class-mates' ratings, in both 
es for the fifth year sub-groups just failed to 
ificance ; all other differences were significant. 
only two sub-groups, the third and fifth years 
erences, at the 5 per cent. level. As in the case 
ersistence, no unsuccessful sub-group achieved 
the corresponding successful sub-group. 


TABLE 3 
INTEREST.—MEAN Scores FOR SUCCESSFUL AND UNSUCCESSFUL GROUPS. 


Successful Unsuccessful 


N M N M 


Teachers’ ratings : 
1953-4 .. 151 4 149 =f 

149 . 149 = ia ks 

Class-mates’ ratings : 
1953-4 v 
1954-5 


Y" 151 


149 —0:5 
149 


Questionnaire : 
Used in 1954-5 only 


est in the questionnaire scores. For the first and 
second year sub-groups, the differences were not significant, but there was 4 
rise : Ween the means at the third-year level. The 
highly significant differences, at the | per cent. level, in the third, fourth and 
fifth year sub-groups were brought about by a decline in the scores of the 
the successful remained steady in all sub- 


^ 
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groups. A possible explanation js that consciousness of failure may cause a 


significant change in attitude to school. 
For persistence, independence and interest, then, significant differences 


between scores of successful and unsuccessful groups appear at all age levels, 
for at least two different types of assessment, 1n both years that the experiment 


was conducted. - : P 
The results for other qualities may be summarised as follows : 


(a) No consistent relationship was found between assessments of domin- 
ance and academic performance. There was no evidence to suggest that 
unsuccessful groups sou! tion in dominant behaviour for their lack 


ght compensa’ 
of scholastic success. 

b) There was no evidence that unsuccessul groups Were considered, or 
considered themselves to be more nervous. In fact, the tendency was in the 
other direction : teachers regarded many successful sub-groups as more 
nervous, although there was no support for this view from class-mates or from 
the questionnaire. g 

idered some successful sub-groups to be more emotion- 


c) Teachers consi r s to 
ally stable, put there was practically no support for this view from the other 


sources of assessment. 
(d) There was a tendency for older successful sub-groups to regard them- 
selves as less extraverted than the corresponding unsuccessful sub-groups. 

e) Some successful sub-groups considered themselves to be less sociable 


than their unsuccessful fellows, but this view was not supported by class-mates' 
ratings. 


SMENTS AND G.C.E. PERFORMANCE OF GROUPS MATCHED 


[J], PERSONALITY ASSES 
FOR 1.Q. 


In this investigation, the methods of personality assessment previously 
ut this time the comparison was made between the mean 
scores of contrasting groups of fifth-formers, each group consisting of just over 
1.Q., but judged successful or unsuccessful on their 

performance in the‘ O’ level G.C.E. examination of the N.U.J.M.B. All were 
taking G.C.E. for the first time at the end of a normal five-year course. The 
e those already made in the second half of each 


tched, to within 5 points of I.Q., from the successful 
fi e passes) group and the unsuccessful (three or less passes) group, 
iue d cir each pair having at least two more passes than the 


unsuccessful one. 
TABLE 4 


ANDIDATES, MATCHED ON LO. 


Data RELATING TO Groups OF Firrn-vear G.C.E. C 


N | Average I.Q. Average No. | Difference 
of passes 

a t 

1954 

Successful. ...-- 84 120-94 Moray House 4-91 TT 

Unsuccessful ..| 4 120-85 f Advanced 7 2-20 eo 

1955 e 

Successful.....- 32 119-25 Moray House 5-53 

Unsuccessful 32 119-47 f Advanced 6 1:78 

j 
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It will be seen that the successful averaged about five passes each, while the 
unsuccessful averaged only two, a very real difference in achievement. 
TABLE 5 


PERSISTENCE.—MEAN SCORES oF SuccressFuL AND Unsuccessrut G.C.E. Groups, 
Matcuep on 1.Q. 


Successful Unsuccessful 


N 


Teachers’ ratings : 


ai . 34 
32 


34 


Persistence ratings are seen to be inde 
significantly between groups of comparabl 
formance. The questionnaire scores show 


pendent of LQ. ; they distinguish 
e I.Q., but of differing G.C.E. per- 
no such tendency. e 
TABLE 6 


INDEPENDENCE.—MEAN Scores or SUCCESSFUL AND UwsuccEssrUL G.C.E. Groups, 
MATCHED on LO. 


Successful Unsuccessful 


N M N M Sig. 


Teachers' ratings : 
1954 
N.S. 
5% 
(Almost 
1%) 


5% 
(Almost 
1%) 

5% 


— 
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TABLE 7 


L "js eee G.C.E. Groups, MATCHED 
ox LQ. 


INTEREST.—MEAN SCORES OF SUCCESSFU: 


Successful Unsuccessful 
N M N M Diff 
Teachers’ ratings : 
1954 IRE TT 34 4-121 34 —0-79 | +2-00 
1955 cca He He 32 41:87 32 —0-87 | +2-74 
Class-mates’ ratings -++ 
1954 5 r 34 4-0-36 34 —0:39 | 40:75 
1958 ee 32 +0-28 32 —0-32 | 40-60 
Questionnaire L k 
Questions on interest not included |in 1954. 
ORG) akan qo tele 31 +542 31 41-29 | +413 


nterest differentiate clearly and consistently between the 


Assessments of ii 


matched groups. , 
As in the main study, 


there were no significant differences for dominance or 
hers considered the successful groups to be more 
the differences being significant at the 1 per cent. level, 
cant differences in class-mates' ratings OT in question- 
naire scores. In 1954, the successful candidates considered themselves to be 
less extraverted and less sociable than the unsuccessful, the differences being 
significant at the 1 per cent. level, but this pattern was not repeated in 1955. 
Summing up, then, for each of the qualities persistence, independence and 
interest, significant differences have been demonstrated between the mean 
scores of at least two different forms of assessment for successful and unsuccess- 
ful groups matched on I. No such consistent pattern appeared for any other 


quality. 
n" question may now be asked: Has any ri 
strated between persistence, oe and interest on 
i and failure on the other : 
academic success re subject to the «palo effect: 


i that ratings of personality a 
It is well known him down as very persistent ” 


E o uccessful in school, so I'l put him 
I know that X is suc obe suere a 


mi ical thought- rocess. 
B S pica ‘tions to Teachers and, in the writer's opinion, the team of 
raters was both highly competent and highly conscientious ; they were interested 
and made many suggestions for improvements in the wording of the rating- 
scales. In addition, it should be noted that, in a large school, with parallel 
forms, individual subject-masters may not have a very Clear idea of which boys 
are the most generally successful in their year-group, SO that ‘ halo ' effect would 
be based on performance in one subject only and, in that way, would be limited 
in its extent. It might have been better to have each boy rated by as many as 
five masters, but there is 2 limit to teachers' time and forbearance. 

In order to minimise ‘halo’ effect in class-mates’ ratings, the character 
sketches read out to the boys were deliberately worded to exclude references to 
school situations as far as possible, although obviously, this was impossible in 


the case of interest. 


emotional stability. Teac 


nervous in both years, , 
but there were no signi 


eal connection been demon- 
the one hand and 
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In the questionnaire, answers may be a deliberate distortion of the truth 
or boys may vary considerably in insight into their own reactions and thought- 
processes and in the amount of conscientiousness with which they answer the 
questions. 

The devices used for personality assessment were, therefore, far from perfect 
but at least there were three of them and the whole experiment, which related 
to boys ranging in age from 11+ to 16+ was repeated in two successive years. 
For persistence, independence and interest, the three different forms of assess- 
ment, which were in general agreement, seem to point to a coherent and 
consistent pattern of relationship with academic success. 

A further check is, of course, to consider the accuracy of the predictions 
of academic performance which can be made from these personality ratings and 


to compare the predictions given with those derived from other sources. This 
topic will now be considered. 


IIL— PERSONALITY ASSESSMENTS AND THE PREDICTION OF ACADEMIC SUCCESS. 


_ Of the three devices used for assessment of persistence, independence and 
interest, only teachers' ratings are here considered as predictors, since it would 
be impossible to make any practical use of either class-mates' ratings or 
questionnaire scores in primary schools, whereas it has been suggested that 
primary school reachers' ratings of personality should be used as an element in 
selection at 11+, and Universities commonly ask for such ratings, in the case 
of sixth form pupils applying for admission. It seems reasonable, therefore, to 
see what relationship exists between teachers’ ratings of personality and the 
pupils' academic performance. 

As an estimate of the reliability of grammar school teachers' ratings, 
correlations. were calculated between the combined assessments given by 
teachers for persistence, independence and interest made in 1953-4 with those 
given in 1954-5 for the four year-groups of about 120 boys each, who were 
assessed on both occasions. These assessments range from +18 to —18. 


oc correlations for the two sets of assessments are presented in 
le 8: 


TABLE 8 


CORRELATIONS BETWEEN TEACHERS’ Ratines, 1953-4 AND 1954-5. 


r 
1953-4 Ist Year ratings and 1954-5 2 d Y. | 506. 
1953-4 2nd Year ratings and 1954-5 Sea V. 


ar ratings ...... -506 


5 Y i Year ratings 
1953.4 3rd Y : = E Year rati ee tn :657 
rd Year ratings and 1954-5 4th Year rati : e 


1953-4 4th Year ratings and 1954-5 5th Year ratings 1720 
Average r (Fisher's ' z ' tech “665 


There is a reasonable consi: 

; stency, therefore, in the teachers' rati The 

average correlation c TORE y , he teachers’ ratings. 
ality e tings: oefficient is about that often reported in studies of person- 
achievement for the a the correlations between these ratings and academic 
o PERS ane. fe m the ratings were made. For first, 
‘as ‘ : T boys, the criterion of academic performance 
was the ' achievement index already described; for fifth vu bonu, only 
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G.C.E. candidates taking the examination f i 

. Car 2 or the first i 

the criterion was the number of G.C.E. passes. se He eres aes vus 
correlations are presented : i Sc 


TABLE 9 
CORRELATIONS BETWEEN TracuERs' RATINGS OF PERSONALITY AND ACADEMIC 


ACHIEVEMENT IN THE SAME ACADEMIC YEAR. 


1953-4 1954-5 


lst Year 
2nd Year 
3rd Year . 
4th Year . 
5th Year .. 
Average T 


lst Year 
2nd Year 


3rd Year 
4th Year 
5th Year 
Average T 


ratings of personality reach a consistent and 
fficiency in predicting performance in internal 
and, at the fifth year level, in an external examination. 

er the forecasting efficiency of the 1953-4 ratings in 
i the same criteria of academic perform- 


Grammar school teachers' 
fairly high level of forecasting € 


examinations 
Let us now consid: à 
predicting achievement in 1954-5, using 


ance: 
TABLE 10 


Gs OF PERSONALITY, 1953-4, AND ACADEMIC 


OLLOWING YEAR, 1954-5. 


VEEN TEACHERS’ RATIN 
ACHIEVEMENT IN THE F 


CORRELATIONS BETV 


SS 


sand 1954-5 2nd Year achievement 118 


1953-4 Ist Year teachers’ rating: 
3 sand 1954-5 3rd Year achievement 120 


1953-4 2nd Year teachers' rating: i 

1953-4 3rd Year teachers' ratings and 195 -5 4th Year achievement 121 

1953-4 4th Year teachers" ratings and 1954-5 5th Year G.C.E. result 87 324 
Averager .- -495 


There is a decline in the forecasting efficiency of the ratings, a decline 
particularly marked as far as G.C.E. performance 15 concerned. The fourth 
year ratings show & correlation of -324 with G.C.E. performance, whereas the 
fifth year ratings give à correlation of .592. Experience suggests that, under the 
influence of the imminent examination, many G.C.E. candidates will begin to 
show increased persistence and a more favourable attitude to school work 
i fth year. 
during the A see what happens when we attempt something much more 


difficult—to predict G.C.E. performance three years ahead. For the second 
year group © ers' ratings of persistence, independence 


1954-5, we have teachers | d 

t, we i hievement index tor that year and, in addition, 

end is 1.Qs. derived from Moray House ‘Advanced 6 Test, taken in June, 
1955. We,can, therefore, compare achievement index, teachers’ ratings of 
personality and I.Q. as predictors of G.C.E. performance in 1958 : 
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TABLE 11 


CORRELATIONS BETWEEN PREDICTORS, 1955, and G.C.E. PERFORMANCE, 1958. 


Achievement index, 1954-5 


Teachers’ ratings of personality, 1955.. 
I.Q. (Moray House Advanced 6), June, 


Teachers’ ratings of personality are by no means as good a predictor of 
G.C.E. performance as achievement in school examinations, although they are 
better predictors than scores derived from an Intelligence Test. N 

This is very much the same pattern of results as that in an experiment 
previously reported by the present writer (Times Educational Supplement, 
14th Nov., 1958, p. 1649). In that case, for a group of about fifty or sixty boys 
entering the grammar school, ratings of persistence, independence, interest 
and other qualities, made on the scales given in the Appendix, by only one 
primary school teacher before the 11-1 examination, were available. Tables 
showing rank-difference correlations between, on the one hand, performance 
in the 11+ examination, teachers’ ratings of personality and I.Q. as measured 
at the time of the 11+ examination and, on the other hand, the achievement 
index for these boys at the end of each academic year in the grammar school, are 


here repeated and brought up to date : 
TABLE 12 
1954 INTAKE.—CORRELATIONS BETWEEN PREDICTORS AND GRAMMAR SCHOOL 


ACHIEVEMENT. 


Achievement Index 114- 
examination 


Personality 
ratings 


Ist Year G.S., 1955 658 +121 
2nd Year G.S., 1956 574 ` -208 
3rd Year G.S., 1957 607 E :262 
4th Year G.S., 1958 -561 


" :275 
5th Year G.C.E.1959 467 : 303 


TABLE 13 


1955 INTAKE.—CORRELATIONS BETWEEN PREDICTORS AND G.S. ACHIEVEMENT. 


Achievement index 114 


1.0. Personalit: 
examination Q y 


ratings 


Ist Year G.S., 1956 352 i mE 
2nd Vear G.S., 1957 -287 055 272 


:103 3 
3rd Year G.S., 1958 310 106 Ge 
4th Year G.S., 1959 443 E E 
5th Year G.C.E.1960 :321 254 d 


For both intakes, the primar 


Y school teachers' rati f pefsonality are 
better predictors than I.O., but ratings of pef: 


less good than 11+ examination positions. 


r= 
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In the bottom line of the table for 4 i 
the 1955 intake, the enna i ve ee amem vee: ied 
the 195 level, those with teachers' ratings at the 10% level E pie E 
tions with I.Q. are not significant, as is to be expected "with small eue 
selected grammar school groups. In the bottom line of the 1955 table pod 
of the correlations with G.C.E. performance is significant but the o iv 
magnitude is the same as for the 1954 intake and as in table 11 which cem 
pared predictions of G.C.E. performance made three years previously ud 

This is an interesting further piece of evidence, suggesting the importance 
of the role played by persistence, independence and interest in ibarscing 
scholastic success. But on such evidence as this, we would not be justified ed 
replacing the 11-+ examination by primary school teachers’ ratings of their 
pupils’ personalities, although it might be worth while experimenting with 
such ratings as an clement in selection. 

For the moment, and until some less crude and more effective method of 
assessing aspects of personality is devised, the much-decried 11-- examination. 
imperfect as it is, seems as good a predictor as we are likely to have in this 


imperfect world. 
IV.—APPENDIX. 
(1) TEACHERS’ RATING SCALES. 


(a) Persistence. 
+2 Persistent in overcoming difficulties. Extremely steadfast. Never gives 
up. Decidedly strong-willed and persevering. 


+1 Rather persevering. 
0 Average for his age. 
—1 Gives up rather easily. 
_2 Decided lack of will-power and persistence. Easily discouraged and gives 
up at the slightest difficulty. 


(b) Independence. 
dent of others in thought and action. Never led by others. Can 


+2 Indepen 
understand a problem better by studying it alone, rather than by discussing 


it with others. 
4.1 Not usually led by others. Will usually stick to his own point of view. 
0 Neither markedly dependent on others nor independent. 

—] Sometimes led by others. Will usually defer to another's point of view. 
—2 Markedly dependent on others in making decisions. Waits for others to 
give a lead. Usually prefers to be with others. 


(c) Interest. 

+2 Shows keen interest. 

+1 Interest above average. 
0 Average interest. 

—] Interest below average. 


—2 Not interested. Unconcerned. 


(2) CuanacrER-SKETCHES USED FOR CLASS-MATES' RATINGS. 


(a) Persistence. " 
(i) This boy is very good at sticking at a job or a game until it is finished. He 
never wants to give uP until he has finished. He is very strong-willed and persever- 


ing. He is persistent in overcoming difficulties. 
(ii) Next write down the names of the boys who are next best at sticking. at 


something they have started. They don't usually give up until they have finished. 


They are fairly strong-willed and persevering. 
(iii) Now we are looking for the people who are the poorest at sticking at some- 


thing they have started. These boys never settle very long at anything. They are 
easily discouraged and give up at the slightest difficulty. 
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(iv) Now write the names of the boys who are the next poorest ' stickers. 
They give up rather easily and are not very good at sticking at anything they have 
started. p 

b) Independence. . 
4 (i) fis type of boy is very independent. He is never led by others. He tries 
to work out things for himself. He probably pefers to work on his own. He usually 
Sticks to his own point of view, and is not influenced by other people. 

(i) Now write the names of the boys who are almost as independent as the 
ones described in the previous section, but not quite. : 

(ii) This type of boy waits for others to give a lead. He can't make up his 
mind for himself. He seems to be dependent on others. 

(iv) Now write the names of any boys who are almost as dependent as the one 
described in the previous section, but not quite. 

(c) Interest. 


(i) This type of boy is extremely interested in school work. 

(ii) This type of boy is fairly interested in school work. 

(iii This type of boy isn't interested in school work at all. He couldn't care less. 
(iv) This type of boy shows very little interest in school work. 


(3) STATEMENTS USED IN THE QUESTIONNAIRE. 
(a) Persistence. 


7.—1 can work at a hard task for a long time without getting tired of it. 
17.—1 take a pleasure in overcoming difficulties. 


27.—1 can keep on at a tiresome task without someone prodding me. 
37.—1 finish most things which I be in. 

47.—1 can enjoy a lon, 
57.—I stick at a job even thou 
67.—I return to a task which 
77.—1 usually refuse to admit 
87.—I usually refuse to admit defeat. 


97.—1 will go to any lengths rather than be called a quitter. 
(b) Independence. 


3.—I would rather work out my own plans than carry out the programme of 
someone else, even of a person whom I respect. 


13.—I become stubborn and resistant when others try to force me to do things. 
23.—I try to work out things for myself if I am in trouble. 

33.—I argue against people who attempt to assert their authority over me. 
43.—I prefer to do my own planning alone rather than with others. 

53.—I go my own way, regardless of the opinion of others. 

63.—I prefer to work on my own. 

73,—I would rather go without Something than ask a favour. 

83.—If necessary, I could get along quite well without friends. 


93.—I usually say * No ' when others offer to help me. 
(c) Interest. 


4.—1 get on very well at school. 


14.—I think that the subjects we study at school 
24.—The day passes very quickly when I am at E 
34.—I think that boys who can Stay on at school 
44.—I think that what is taught at school is ve 
ay on at school as | 


are very interesting. 
chool. 

in the sixth form are lucky. 
ry important. 


(Manuscript received, October 19th, 1959.) 


PLACEMENT PROBLEMS AMONG ENGINEERING 
APPRENTICES IN PART-TIME TECHNICAL COLLEGE 
COURSES 


Part I.—RANGE OF ABILITY. 


By ETHEL C. VENABLES 
(Nuffield Research Unit, Institute of Education, University of Birmingham.) 


Summary. Three groups of part-time day release students taking first year 
Ordinary National Certificate Classes in engineering were given à verbal and 
non-verbal intelligence test (N> 1000). On the verbal test the mean score was 
only slightly above that for the population as a whole, but on the non-verbal 
test 89 per cent. were at or above the fiftieth percentile with 97 per cent. in the 
top 10 per cent. of the population. Recent attempts to improve the selection 
of Ordinary National Certificate students appear to be unrelated to intelligence 
test scores, as the results obtained in two different areas between 1950 and 1957 
show no significant differences. Fifty per cent. had attended secondary modern 
schools as against 73 per cent. of the whole school population in the same age 
range. Seventy-five per cent. had fathers in skilled and managerial jobs, com- 
ared with a national average of 53 percent. In 1957, only 18 per cent. of those 
not in full-time education were attending part-time day classes. 
The educational and sociological implications of these figures are discussed 
in relation to manpower problems. An increase in the number of such students 
must increase the proportion from secondary modern schools and unskilled 
homes, so that the problems associated with teaching young people of poor 


verbal ability are unlikely to diminish. 


J.— INTRODUCTION. 


Tuis paper is the first of three which will summarise the results of a study of 
part-time day release engineering students in National Certificate and Trade 


courses in technical colleges. E : 
Tt deals first with the range of ability to be found in these classes. It has 


been suggested that it is here that any reserve of ability is to be found. Next 


it deals with the differences between the verbal and non-verbal abilities of these 


students. 

Until recently, students in technical colleges have more or Jess escaped the 

attention of educational psychologists. Originally, the local ‘ tech’ embodied 
d and a second chance for all 


the idea of opportunity for the under-privilegec ‘ ; 
comers, so that each student was allowed to decide for himself which course or 
examination he would take. The growth of industry and the increasing need for 
education has led to à great change within the technical colleges, from a situa- 
tion in which only highly motivated individuals presented themselves as poten- 
tial students, to one in which part-time attendance at the local technical college 
is frequently a requirement of the industrial job. The attitudes of such 
students range all the way from that of the keen volunteer of the last century to 
the indifference of the * practical' man who rejects completely any intellectual 
approach to the work he has to do. Between these two extremes the young 

eople concerned are likely to be influenced in their attitudes to college studies 
not only by teachers and classroom companions, but by foremen and managers 


who may or may not regard academic certificates as a necessary adjunct to 
practical skill. 
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i le 
In 1952, two years after this research began,13 per cent. of young peop 
aged 15-17 mam who were not undergoing full-time education, me 
technical college classes on one day per week. Engineering industries supp! “A 
by far the highest proportion of these students (34 per cent.) and the results 
reported here refer to students in engineering departments only. BM 
departments there are two main types of courses—National Certificate and a 
variety of trade courses leading to Certificates of the City and Guilds of London 
Institute. In recent years the need for a third stream!? for those of lower 
academic ability has been recognised and many colleges now run three types 2 
classes within one department. In an educational system with a fairly rigi 
selection for grammar school and university, the policy of an ‘ open door’ in 
further education is too valuable to lose, but the problems of ‘ placement ' in 
appropriate courses are becoming increasingly difficult. 


IL—MzTHOD or ENQUIRY. 


Batteries of tests were given to first-year engineering students in National 
Certificate and Trade courses in 1950 a 


nd 1952 in four colleges in the North- 
West and in 1957 in three colleges in the Midlands. The batteries varied slightly 
on each occasion but all students were given the N.I.I.P. Group Test 33 and 
Raven's Matrices (1947 version). (The 1938 version proved too easy and did 
not discriminate between the two groups of students.) 


IIL—Rzsurrs. 


Test scores obtained in 1950 and 1952 in the North-West and in 1957 in 


the Midlands, showed no significant differences. 


Twenty-seven per cent. of students in the first year of an Ordinary National 


Certificate engineeering course were in the top 30 per cent. of the population 
on the verbal test of abilit 


y. On the non-verbal test 77 per cent. had scores at 
or above the 70th percentile (Table 1). 


TABLE 1 
PERCENTAGES OF PART-TIME Day RELEASE ENGINEERING STUDENTS SCORING WITHIN THE 
SAME RANGE AS THE Top 


HALF OF THE POPULATION on Two DIFFERENT TYPES OF 
INTELLIGENCE TEST. 


N>1,000 
Part-ti engineering students 
General population acl d 


Raven's Matrices 
1947 N.LLP. G.T.33 


Top 10% 37% 7% 


Top 20% 60% 16% 


Top 30 % 


77% 27% 


, but accordi ly 40 per cent- 
were comparable when judged by the dob rr pod 
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TABLE 2 


LLP. G.T.33 or FIRST-YEAR PART-TIME ENGINEERING STUDENTS COMPARED 


ScorEs oN N 
WITH UNIVERSITY STUDENTS. 


First-year O.N.C. First-year 
Trade 


Scores *University 
Students 1950 1952 5 1950/1952 
combined 


185 and over ..... 


184-165 2... eee eee eres 


164-156 ..... eee eee 


Ehari 
155-145... ee eee eee ee 


144-125 ..... m 


124-105 ..... e te 


Below 105 ..... e 


Within top 75% of Uni 
Group (approx.) 


on Department.!^ 


*. Graduates in Arts and Science in a University Educati 


TABLE 3 


Matrices 1947 OF FIRST-YEA: 
COMPARED WITH UNIVERSITY 


R PART-TIME ENGINEERING STUDENTS 


SconEs oN RAVEN'S 
STUDENTS. 


First-year] 


First-Year O.N.C. 
Trade 


Ši *University = 
Scores Students 1950 1952 1957 1950/1952 
combined. 


1 


4 


Within top 75% of Univ. 
Group (approx-) 


were third-year medical students and the rest graduates in 


* About half this group 
Arts and Science. 
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IV.—Discussion. 


During the years covered by this research there has been much discussion 
about wastage and failure on these part-time courses. Colleges have been 
attempting in various ways to improve their methods of placing students in the 
two or three types of courses available; the more academic National Certificate 
courses on the one hand and the less demanding Trade courses and practical 
' Third Stream ’ courses on the other. 

Intelligence test results obtained in the Midlands in 1957 and shown in 
Tables 2 and 3 are not significantly different from those achieved by students in 
the North-West in 1950 and 1952, suggesting that the methods of selection in 
use are unrelated to intelligence test scores and still fail to exclude from the more 


difficult course those of poor ability who are better suited to trade or Third 
stream Classes.* 


students on the Matrices is conside 
tendency for Arts stud 
Science faculty is well established. The test scores in the first column in Tables 
2 and 3 are for Arts 

exaggerate the differences: it wi 


Science and Technology graduates only, but the full details are not available. 
Tozer and Larwood!? found t 


s ifferent Way, are they those whose verbal ability 
(or lack of it) is such that the grammar « hool Yini i improve 
(or remedy) it? Table 4 dom ‘ School training has failed to imp’ 


d cap 
Advanced level. In the repo 


* This subject is di i ii Fee 3 a 
PE lo j iscussed in detail in the writer's (unpublished) Ph.D. Thesis, Man 


—O—-—— — — MM MM mÀ— MÀ 1 
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TABLE 4 


Scores ow Two TESTS ACCORDING TO PnEvrous SCHOOLING. 


Raven's Matrices 1947 N.LLP. G.T.33 


Percentiles â; 
Grammar 


Grammar 
Technical Modern 


and Private | Technical Modern and Private 


N N | % of 
group 


3% 
16% | 


nN | % of | N | % of | N 
group group 


46% | 37 


pne 39% | 30 


SEE 6| 8% 


Bottom 50 795 


46% 


35% 57% |+ 


Totals ....| 74 | 100% 10095 


are taken, the council also reached tbe rather surprising conclusion that it 


seemed likely that " less than half of the present intake into grammar schools 
could profitably take sixth form work." Even allowing for a large margin of 
error in grammar school selection, this still leaves a Very considerable number of 
adolescents needing further education if their abilities are not to be wasted. 
It is also worth bearing in mind that the parents of a small number of ten-year- 
olds refuse to allow their children to compete for a grammar school place, so 

n school pupils may have high verbal ability. Over 


that a few secondary moder 
s tested had scores placing them in the top quartile on 


a quarter of the student 
in the bottom half of the population on the verbal test. 


the non-verbal test, and in 
fference innate and how far environmental ? Parnell’? 
sical build, engineers can more often be classified as 
also that there is some relationship between 


an arts men, and 
mesomorphy an ili Non-verbal tests, although heavily 
saturated with ' : or for space perception. 
Parnell's results E andi as yet, d, but it is possible that specific 
abilities (rather than gener ility) tend to be linked with physical make-up 
‘Again, physic al make-up May play so direction of interests and 
th in the choice of a job. 1 
il A ronmental differences could arise in at least two ways. Lack of verbal 
stimulation in the early years may ead to a fundamental difference in methods 
of thinking and response which would be difficult, if not impossible, to reverse. 
ion* and on social class and educational 


ght be interpreted in this way. 
e differences could arise through a lack of interest and 


On the other hand, th es 
attention. Differences 1n home conditions may affect early responses to school- 
i t under-achievement. 


i i eject ic values and consequen 
ie us toy extent true, then, when such a youth enrols in the local technical 
e, it is not only he who is being given a second chance, but the educational 
Such students present the teacher—not only of English, but 
iects—with a challenge, which will have to be met if the 


bje 
also of the act o d technological man-poWer is to be solved. 
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Vernon, including in his list of ‘Human Abilities’ factor X (which 1S 
variously called ‘ studiousness,’ ‘ interest,’ or ‘ industriousness ’), says Little 
is known about X and further research would be profitable. In particular, we 
would like to know how far it depends on (a) home background, (b) the tone of 
the pupils's school, (c) the stimulatingness of, or good teaching by, his teacher, 
(d) the pupil's interests, (e) his temperamental characteristics”! Vernon has 
himself produced evidence that the intellect can be ‘ spurred "? though how 
far his results are due to (a), (b) or (c) it is not possible to say. Lovell® in an 
interesting paper in this field has listed the characteristics of a stimulating school, 
indicating how (a), (b) and (c) are related and impossible to disentangle. He 
cites the work of Hearnshaw? on ‘ Concept Formation’ and concludes that 
further research is needed into the problem of whether a “ relative inability 
to categorize is due to poor quality education and limited cultural background 
or whether it reflects a change in the response of an individual to the environ- 
ment." 

Vernon, summarising his own and Lovell's results! concludes that “ Certain 
aspects of intelligence, in particular, flexibility of thinking, are as much or 
more, affected by the stimulation that different environments and different 
types of schools provide as are the conventional attainments. Clearly, far more 
research is needed into the nature of these abilities and into the types of educa- 
tion that best helps to develop them." 

The profound effect of the change from school to work is a matter of com- 
mon observation. That firms vary in their effect on their apprentices has been 
shown in the recent report of the N.LI.P. of a validation study of their tests for 
engineering apprentices and there can be no doubt that colleges and their 
stafís frequently have a beneficial influence. Most students attend these courses 
between the ages of 16 and 21—a time when there can be a great, and often 
unexpected, development, and when their interests are continually changing. 
Certainly one of the most rewarding aspects of work in a technical institution 
is to watch an uncouth, rowdy or shy schoolboy change into a self-respecting 
and skilled citizen. 

Research is in progress in three colleges to discover whether there are any 
changes in test score during the years of study, and if so, to what extent they are 
related to examination success. 

. Viewed nationally the numbers of young people involved in further educa- 
tisn classes is very large (230,000 in 1957/58).° It may be asked what conse- 


quences are likely to follow the increase of day-release students which is en- 


visaged.! In the sample tested only half were from secondary modern schools 
thougb at that time they catered for 73 per cent. of secondary school children, 
and only 9 per cent. had fathers in semi and unskilled jobs, whereas 29 per cent. 
of all Workers can be so classified.? Hence, assuming the sample to be representa- 
tive, it seems likely that the increase in students will mean an increased propor- 
tion from secondary modern schools and from semi-skilled and unskilled families- 
i it seems likely that the proportion with low verbal ability will also 
se. 
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BEHAVIOUR IN CLASS GROUPS OF CONTRASTING 
CLIMATE 


By D. V. CONNOR 
(Lecturer in Education, University of Otago, Dunedin.) 


Summary. In this study classroom groups of varying ' climate ’ were identified 


by means of a questionnaire. A comparison of the behaviour of the groups of 
‘contrasting climate’ was made in terms of social interaction, OP ie 
rapport, social structure, emotional adjustment and attitude to € A 
analysis of the results shows : (1) that the class as a social unit seems to j a $* 
more relevance to the kind of ' climate ' found than does the school as a who e 
and (2) that the quality of the behaviour in those classes which are identified, 


on the basis of the original questionnaire, as having ' good climates,’ varied 
widely from those having ‘ poor climates.’ 


I.—Intropuction. 
Tris now widely agreed that a class must be viewed not only as a group formaly 
set up for learning or instruction, but also as one where a very complex set : 
inter-personal relationships is operating, which must be taken into s gm, 
These dynamic relationships between Pupil and teacher and pupil and pupi 
make for good or poor classroom adjustment and are often referred to as the 
‘ climate ’ of a classroom. In viewing a classroom group in this way a ieschel 
ork, he must go beyond his materials m 
e has at his disposal forces that are very apie 
8 the learning process and for achieving the goals 
which are originally set. 


The important basic elements in any particular classroom climate are me 
home and school settings and the personalities of the participants as individuals. 


However, it is the unique conditions which are generated in the teacher-pupil- 
pupil interactions that constitute th 


classroom climate proper and it is to these 
conditions that the present study is directed. 


IL—PnEvous RESEARCH. 


es of classroom climat. 


nd b he teacher set the 
Lippitt and White (1943) 


Rees , and Anderson and Brewer (1946) showed behavioural 
distinctions between grou teachers and group leaders 


Vo 7 Ings to a theory of instruction or learning in terms i 
pupils goal setting and achieving under different teacher influences. On 
wider frame of reference Kemp 


(1955) has shown that the characteristics of & 
school, and Campbell (1952) that the cultural level of the home also have an 
important effect on adjustment and attainment at school. 
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]II.—PLAN OF THE INVESTIGATION. 


The present study is limited to an examination of the relative in: 

the school and of the class group itself on classroom adjustment Eo, — 
the differing behaviour of groups of contrasting ‘climate.’ It postulates that 
wide variations in ‘climate’ can be objectively demonstrated, that these 
differences are associated with influences inherent in the class rather than in the 
school as a whole, and that the quality of the teacher-pupil behaviour in such 
contrasting classes is consistent with the particular climate revealed. 

(1) Subjects and Materials. In order to locate groups of contrasting 
climate, Flanders’s* “ Adjustment to School " questionnaire was administered 
to all the schools containing Standard TV classes in an urban area. This involved 
681 children from eighteen classes in eight different primary schools which 
sampled all socio-economic levels in a metropolitan area. Flanders's question- 
naire consists of seventy-eight questions of which the following are a representa- 


tive sample : 
he teacher leaves the room? 


1.—Does the class ‘ play up ' when t 
Mes. edis Nos. 
2.—Are you so keen on your work that you always try your best ? 
Yes. 


3.—Have you ever been strapped by a teacher ? 
4,—Are you afraid when you don’t finish your work ? 
5.—Does your teacher have lots of fun with you ? 

6.— Would you work just as hard if there were no tests ? 
7.—Does the teacher make you feel foolish ? ace 
8.—Are you sometimes not sure what to do next in class ? b 

Preliminary work with the questionnaire by Flanders showed that it 
differentiated between well and poorly adjusted groups as judged by Inspectors' 
ratings, and between classes with authoritarian and permissive teachers as 
judged by the frequency and quality of verbal output of these teachers. 

(2) Procedure. Using this questionnaire as a criterion, six ' good ' and 
six 'poor' groups were selected from the eighteen groups and intensively 
studied as follows : b , ac^ A 

(a) Observation Period, during which a study of ‘social interaction ’ 
among the children was made. Each child was observed in turn for a fixed 
period of time and account was taken of the number of ' social contacts ' which 
that child made with other children and with the teacher. —. ; 

(b) Teacher-Pupil Rapport : During a second observation period, Wright- 
stone’s (1951) “ Teacher-Pupil Rapport "' Scale was applied. This is a rating of 
the quality of the interaction between the children and the teacher in terms of 
interest, enjoyment, emotional rapport, physical tension, social relations and 


reactions to orders. j - 
A sociometric test containing questions such as 


(c) Social Structure : i 
d “ Whom would you like to take 


“ Whom would you like to sit next to? ” an 
home to tea? ” was administered to each of the twelve groups. This permitted 


an analysis of the ' group cohesiveness ' of each class. 
(d) Peer Ratings, or “ Guess Who ” test where an assessment is made of 


the ' reputation ' of each member of the class for certain emotional adjustments 
as viewed by every other member. This consisted of small ‘ pen pictures’ of 

upils, and the class was invited to identify one from their own group by means 
of this thumb-nail sketch. In this way, rankings on such personality traits as 


* As yet unpublished. 
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perseverance, suggestibility, popularity, etc., were combined in a composite 
score for ‘ emotional adjustment.’ 

(e) Attitude to School : Fitt's (1956) Scale was given to assess attitude to 
school in general rather than toward the particular class. This is the usual 
Thurstone-type attitude scale of weighted items from which a total score for 
attitude to school was obtained. 

Where possible, the reliability of these techniques, with the exception of 
the sociometric test, was checked in the present study, or evidence of their 
reliability with similar age groups was available elsewhere. The reliability 
coefficients from either of these sources are as follows : Wrightstone’s “ Teacher- 
Pupil Rapport ” Scale -92; Peer Ratings -83; Social Interaction count -61 ; 
and Fitt's “ Attitude to School” Scale -70 (split half uncorrected). It was 
considered that these figures were satisfactory since the study was dealing with 
groups rather than with individual pupils. 

From the above data it was possible to compare the behaviour of the con- 
trasting groups. It was also possible to examine separately both the influence 


of the class as a group and of the school as a whole on the scores obtained 
by the children on the “ Ajdustment to School ” questionnaire. 


IV.—STATEMENT OF RESULTS. 


Using the total scores on Flanders’s questionnaire as the criterion measure 
of classroom climate or adjustment, an analysis of variance was carried out to 
see whether the influence of different schools or classes was contributing to the 
widely varying scores of individual pupils. The results are shown in Table !. 


TABLE 1 


ANALYSIS OF VARIANCE Or “ ADJUSTMENT TO SCHOOL.” 


Source of Variation Sum of 


Degrees of | M F 
Süi&tes grees of | Mean Square 


Freedom 


Between Schools 2412-72 


Between Classes in the same 7 S587 in 


; signif.) 
School 11542-16 1154-22 00:97 


u (signif.) 
40352-08 60-86 


54308-96 


"3 ome within schools ” ue has been sub-divided y 
1 and “ within classes." Comparing the 
mean squares for the different sources of variance it can be seen that the schools 


such matters as school tone an 
etc., was not carried out. 

It was now clear that it was the c] : "— 

i à ass as a unit iti thin the 

class which were largely responsible for the ed E aie 

room climate. An analysis of these differences between the deum d poorest 


| 
| 
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six classes is shown in Table 2. The ‘ sign test ' has been applied to the scores 
of the contrasting groups and the significance of the difference is shown in each 


case. 
By comparing the contrasting classes in Table 2 a general picture of their 


differing behaviour can be gained : 
(a) Social Interaction. It can be seen that the classes with ‘ good ’ climates 


have a significantly higher number of social contacts. This means that the 


TABLE 2 
ANALYSIS OF BEHAVIOUR OF THE CONTRASTING GROUPS. 


Behaviour Observed or Contrasting Groups 


Assessed. 


Good Climate Poor Climate 


Class | Mean No. of || Class | Mean No. of 
Contacts Contacts 


1.— Social Interaction .....- 


thee 


Total Score 


2,—Teacher-Pupil Rapport 2 
19 
22 
16 
14 


Rejects| Stars Rejects| Stars 


3.—Group Cohesiveness — .. I 
11 
9 
10 
14 


S.D. S.D. 


Aaa s 8-92 11:26 
4.—Peers' Ratings .. -+ 10-64 10-84 
10-56 9-12 
8-84 9-32 
110- y 11-48 
11: 9-00 


Mean Mean Score 


5.—Attitude to School ....| 4 43- 58-5 

16- 7 61:3 
51:5 73-1 P< -001 
58-8 E 64-0 
534 7 66-3 
62-4 64-9 


* See Siegel, S. Non-Parametric Statistics. McGraw Hill, 1956, pp- 68-75. 
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children interact socially more or express themselves more freely than in the 
classes with ‘ poor ’ climates during the formal work of the classroom. 

(b) Relations Between Pupils and Teachers. It is very clear that teacher- 
pupil rapport is better in the well-adjusted than in the poorly-adjusted groups. 
That is, according to the ratings, the pupils and teacher interact more; the 
quality of the interaction is warm and friendly rather than aggressive or 
competitive; there is more interest and enjoyment; the pupil's role is clear ; 
there is no confusion; children are relaxed rather than tense, confiding and 
friendly rather than strained and fearful. 

(c) Social Structure of the Groups. If we define ‘stars’ as those who 
received more than 10:pi cent. of the total number of possible choices, and 
' rejects ' as those who were mentioned as being liked least of all, an examina- 
tion of the results shows that there are significantly more ‘stars’ and ' rejects’ 
in the poorly-adjusted classes than in the well-adjusted classes. This means that 
the former have more people at either extreme of ' star ' and ' reject ' and in 
this sense their group structure is more ‘ fragmented ' or less ‘ cohesive’ than 
the well-adjusted classes. 

(d) Emotional Adjustment of the Groups. Since, by the nature of the rating 
technique used, a distinction could not be made between the contrasting classes 
on the basis of mean scores which were zero in all cases, the scatter of the scores 
wasexamined. It can be seen that there is no clear-cut difference between the 
groups in emotional adjustment as measured by this technique. There was also 
no significant relationship between the scores of individual children on the 

Adjustment to School " questionnaire and on this kind of rating on emotional 
maladjustment. It seems that a child's ‘ reputation ' for emotional adjustment 
as seen by his classmates is something rather different from his ' school adjust- 
ment" as measured by his own assessment of himself on a personal questionniare. 

(e) Attitude to School. It can be seen from Table 2 that the mean scores 
for the ‘ good : classes are significantly lower* than those for the ‘ poor’ groups, 
or that the attitude to school of the former is much better than that of the latter. 


In other words, good classroom climate seems also to be associated with good 
attitude to school in general and vice versa. 


V.—DISCUSSION AND CONCLUSIONS. 
The results indicate that it 


[ is possible by means of the questionnaire 
described above, to identify classe: F n | 


h e S varying from ‘ very good’ to ‘ very poor’ 
in classroom climate, and that the characteristics of these classes differ widely. 
In almost every case desirable behaviour is shown in the classes where the 

climate’ is good. Children of classes with ‘ good ’ climates interact more with 


one another and with the teacher, have better emotional rapport with the teacher, 


are socially more cohesive as a group, and 1 i ol as a 
while, Od theuther han, children calm wb s cede to schon fai 


of classes with ' poor ' climat more rig! 
and formal, are more distant from the teacher, Mu EIS «sepia asa saral 
group and have a poor attitude towards school in general. 

There are indications also from this study that the type of classroom climate 
produced is not so much dependent upon the surroundings or the setting of à 
school but upon what goes on within the class. This particular result, however . 
may be due to the fact that the original questionnaire unduly reflected the 
influence of the teacher and did not sample wider sociological influences. There 
is little doubt, though, that in general the school surroundings and the home 
backgrounds of the pupils, as well as individual personal attributes of pupils an 


* High scores on this scale indicate poor attitude to school. 
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teacher, have their influence, but it is the interaction of these in a dynamic 
classroom situation that has more immediate relevance for the teacher and the 
learning process. 

This study does not offer any evidence on the best way to achieve a good 


classroom climate. It may well be that a good teacher can create a favourable 
climate directly by the influence of his personality, and his group should then 
have all the attributes of a well adjusted class as mentioned above. On the 
other hand, another teacher may achieve the same end indirectly by concentrat- 
ing on single aspects of the process and eventually bring about a good classroom 
climate. Whatever procedure is characteristically adapted, this study has 


shown that important end products are good rapport between teacher and 
pupils, a reasonably permissive atmosphere, a ' cially well-balanced ’ class- 
room, and a healthy attitude to school in general. 1i the attention of teachers 
can be drawn to such findings, they may use them as criteria for judging their 


own classrooms and so be aware of the need or otherwise for evaluating their 


own position. 


GMENTS.—! wish to thank the Director of the New Zealand 
Council for Educational Research, with whose help the project was carried out, 
for his assistance in planning the work. I should also like to thank the Otago 
Education Board for the facilities offered and the headmasters and teachers who 


so willingly co-operated. 


ACKNOWLED! 


VI.—REFERENCES. 


1 Axprnsox, H. H., and BREWER, H. M. (1946). Studies of te: 
ities. Applied Psychol. Monographs, No. 11, 1-156. : 
The influence of home environment on the educational progress 
children. Brit. J. Educ. Psychol., 22, 89-100. 
udy of children's attitude to school in Auckland, 
25-30 


achers' classroom personal- 


of selectiv y sch 
3 Frrr, A. B. (1956). An experimenta 
1 Brit. J. 


mental learning 


situations. . 
> ainment 


s emp, L. C. D. (1955). i 
in primary schools. it. J. wel 
s Lippitt, Re and WHITE, L. K. (1943). The 
Barker, R. G., et al. Child Behavior and D: 
7 WRIGHTSTONE, J+ w. (1951). Measuring the socia 


44, 341-351. 


5, 67-77. 
~jal climate of children's groups. in 
»elopment. McGraw Hill, pp- 


1] climate of a classroom. J- Educ. Res., 


(Manuscript received, Nov. 10th, 1959.) 


SYMPOSIUM: THE DEVELOPMENT OF MORAL VALUES 
IN CHILDREN 


VIII.—Freup’s THEORY OF MORAL DEVELOPMENT IN RELATION 
TO THAT OF PIAGET 


Bv R. S. PETERS 


(Departments of Psychology and Philosophy, Birkbeck College, University of 
London.) 


Summary. The purpose of this article is to develop some conceptual distinc- 
tions in order to explore the relationship between Piaget's and Freud’s theories 
of moral development and to Suggest questions to which both theories give rise. 
Piaget, it is argued, did little more than show that the distinction between 
customary and rational morality actually has application, Freud's theory 
point of development but his actual 
chanisms by means of which customary 
generation and with (ii) the reasons 

why people develop an exaggerated and distorted Style of rule-following and 


n A , have a positive theory about the 
conditions under which the development towards rational morality tends to 
take place, except for Suggestions in Freudian theory about necessary conditions 
such as a proper love relationship in the early years. Indeed, there is general 
ignorance about the results of different methods of passing on rules, methods 
which need to be carefully distinguished from each other. i 

It is suggested, finally, that the development in Freudian theory of the 


theory of the ‘ autonomous ego ’ might do much to unify Freud's and Piaget's 
theories and to Suggest fruitful lines of investigation, 


den ts, suck I d M 
Character I, nvestigation Inquiry! without aequa cone Hartshorne an ES 


A the percepts less blind. T ropose to 
attempt this by settin out Piaget’ i et i 
Freud's theory in relation to it. Bete theory very briefly and by dealing with 


he distinction between what might be 
lowing of a rational moral code. By 


A Ing a particu], i Snina] 
sat soya art ea lea 
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the question of the validity of such a code does not arise, as in a closed, 
tight-knit, society, where norms tend to be undifferentiated. By a * rational 
moral code ' I mean one for which the individual sees that there are reasons 
which he sees could be otherwise than itis, which he follows more reflectively. i 
Now Kant, of course, made this sort of distinction when he contrasted the 
autonomy with the heteronomy of the will; and Piaget, both in his moral 
theory and in his theory of knowledge, has a Kantian point of view. What he 
did was to pour into the mould of this conceptual distinction a rich filling taken 
from observation of children at different ages. He showed that the distinction 
actually has application. 
Piaget studied the attitude of children to the rules of both marbles and 
morality and found a correlation between them.? At the ‘ transcendental stage ’ 
the rule appears as something external and unalterable, often sacred. At the 
autonomous stage it is seen to be alterable, a convention maintained out of 
mutual respect which can be altered if the co-operation of others is obtained. 
Constraint is replaced by reciprocity and co-operation. A lie is no longer just 
something ‘naughty’ which adults disapprove of, a prohibition which goes 
very much against the ego-centric wishes of the child, a command stuck on to a 
mental structure of a very different order from that of the adult, whose letter 
has to be adhered to, but whose intention is incomprehensible. It now becomes 
an action which destroys mutual trust and affection ; truth-telling becomes a 
rule which the child accepts as his own because of the reasons which can be 
given for it. To summarize the main features of Piaget's contributions : 
(i) Piaget insists that there is something of the sort. which Kant described 
as morality proper, distinct from custom and authoritative regulation. y 
(ii) Piaget assumes, So it seems, some process of maturation. There is a 
gradual transition from one mental structure to another. ' . 
(iii) He assumes that this development in the child’s attitude to rules is 
parallelled by his cognitive development in other spheres—€-g-, the grasp of 
logical relations, and of causal connexions. 
zed both for what he did do and for 
of criticism should be included 
those of} F: Spe who bage dae es that X pne 
i included under the concep of ‘ autonomy ' which are no 
soris ¢ dine at which also have not all a high mutual correlation. And, of 
prs "mueh in general could and has been said about Piaget's methods of 
investigation. — . z, are criticisms of what Piaget did not do. For 
More interesting, how? vestigated whether the transition through the 
ity is just a matter of maturation, or whether it depends 
ily or educational traditions. In a lecture which he 
that value judgments become 


elopment depends on general features 
t to establish this in any detail. E. 


b) Comments.—Piaget can be critici 
‘Amongst the former type 


more equita 3 
ut he makes no attemp 1 
of the cule estigated this question and finds a correlation between the sort 
d the social status of the parents and 


hich Piaget outlined an het : parer 
é -ercive techniques of € ild-rearing were employed. 
E to ind Taba! also studied the moral attitudes of adolescents in a 


À ji è i Indian tribes and tried to relate 
i n and of children in six American 
sii Wen he ds of training, cultural factors, etc. —. 

Such studies are suggestive but very inconclusive. From them three very 


d be made about Piaget's work. 


general observations coul 
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(a) Piaget's distinctions provide a useful framework for research. The 
details, however, of this descriptive apparatus need clarifying and tightening 
up—e.g., his concept of the autonomous stage. 


(b Given that some such transition sometimes occurs, much more needs 
to be established about the conditions which favour or retard it. These would 
include a variety of social factors, but of particular importance would be the 
techniques for passing on the rules of a society. As far as I know, Piaget says 


nothing about the ways in which parents and teachers help or hinder children 
in the transition to the autonomous stage. 


(c) Piaget says nothing of the extent to which the relics of the ‘ trans- 
cendental stage ' persist in the adult mind, and the conditions which occasion 
a complete or partial failure of the transition to autonomy in a society where such 
a development is common, and encouraged. 

As Freud, so it seems to me, said a great deal in an indirect way about such 
matters, it is appropriate to pass to his contribution. 


IIL.— FnEUD's CONTRIBUTION. 


, Piaget. I have stressed, explicitly made the distinction between conven- 
tional morality and following a rational code. In Freud this distinction is only 
implicit and the features of a rational code are not explicitly sketched. 

, Philip Rieff, in his recent book,5 makes much of what he calls Freud's 
ethic of honesty and of his uncompromising egoism. He suggests that Freud 
à K ankness that is a necessary procedural 
requirement for psycho-analysis. A man must admit his nature, be quick to 
detect dishonesty and sham in himself and in others. He must accept his 
uspicion of ‘ moral aspirations’ such as Freud 
ter « with middle class women at the turn of the 
century. Freud's ‘ education to reality’ and ‘ the primacy of the intelligence,’ 
which he explicitly advocated, go no further than what should be called prud- 
per as Piaget understood it. For Freud, on 

vie nt of Psychological man, the trained egoist. 
Rieff $ account of Freud the moralist, is interesting not only in stressing the 
cool rationality of Freud 5 own moral outlook but also in giving a certain 
interpretation of it. This interpretation is highly disputable. For, although 
Freud described the principle of impartiality or justice as a cloak for envy,’ 
sense of justice and consideration 
taking advantage of them I can 
"19 This looks very much like the 
as one could find in Sidgwick—or, 
confession of a man who believed 


measure myself with the best people I know, 
confession of a rational Utilitarian code such 

c It does not sound like the 
only in prudence or rational egoism. 


It is, th i r " 
ration d dm a uu to say whether Freud himself subscribed to a 


prudence attributed to him b Rieff S A a 
article, it does not matter. For sihat ees from the point of view of this 


promptings and precepts. 
distinction between rational : 
and about the desirability of rational behavi, eo oe 
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to reality ' intelligible and his therapeutic practice square with his theory about 


morals. 

As a matter of fact the distinctions which he made betwe 
] " S 5 Wi en the e - 
ego, and id, were ways of making the distinction between behaving Saat 
and behaving in other ways. He equated the ego with reason and sanity ; he 
ays it tests correspondence with reality’, ' secures postponement of motor 
ischarge, l : * When we ignore the 
pictorial model which Freud's concepts suggest, we see that their main function 
count of facts, 


being at the mercy of the super- 


irrational promptings of past 
ind this basic conceptual framework we can see, roughly 


If we bear in min 
speaking, that Freud’s contribution to moral psychology 


headings : 

i) In the theory of the id and the unconscious We 
typical occasions when we are deflected from conscious aims, when we cook 
up rationalizations as a cloak for following our jnclinations, when we act 
unaccountably ' out of character, and when we seem to suffer from à general 
inability to decide between different possible courses of action, to control our 
impulses, oF to carry out intentions however well-meaning. Such investigations 

t light on what might be called the executive side of moral action. 
do not intend to say anything more about this side of Freud’s work, in spite of 
its intrinsic interest. I propose to concentrate on the extent to which Freud 
deals with defects on what might be called the legislative or judicial aspects of 

niss with the sorts of rules which 


This implies something an e we apply 
ith the inability t Jes fit particular cases. Suc 


defects are to be found. 
(ii) In Freud’s theor 
(iii) In Freud’s theory 
These I now propose to discuss. 
The Superego he extent to obvious et unt of me bere 
" » i a 
theory of the super-ego 1S he Ts dope 


findings about the € 
d, it might be argued, went further and 


find a description of 


o see when ru 


y of the super-ego. 
of character-traits. 


identification, and reaction- 
osed sacrosanct commands 


ion, by me s 
formation, PY rized by the child and the standards : 
i ld explain the perfectly familiar 


come to be interio: 
with whom identification takes place. This would ex f 
-e of standards being passed on from generation to generation by contact 
i «ert some kind of discipline and provide 


procedur eo S 
i liest admired figures who exert son 
i ght be commented, stressed the 
he took for grante a way of passing 
hich, yas prevalent ina atriarchal 
on rules yi he father exerted discipline in an authoritarian abes with a 
lar of conduct. Freud's 


s taken as the exemp 
tive notion of taking the voice of prohibition 


eory look 
n elves and the more pos 


mired figure- 
r an at Freud’s theory of the super-ego does not altogether 
tation. Even in the matter of the authorita- 
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On the one hand, he was interested in the fact that some children develop 


y their parents; on the other 


miliar feature of types like the arrogant or humble 
man who have a picture of themselves, an_ego-ideal—what Adler called a 


viewed as being more or less constant. The Over-conscientious child is one who 
has turned his aggression inwards ; 


the ego-ideal is the product of narcissism or 
outgoing love turned inwards. The obses 


character, 


lowing Prevents the development towards a 
more rational way of following rules such as was sketched by Piaget in his 
account of the autonomous stage 

In his account of the Super-ego, therefore, Freud's theory is an important 
Supplement to that of Piaget. For Freud tries to explain why it is that people, 
i ge. Some get stuck as a person 

I less way to the standards that are passed on ; 
others exaggerate or distort the standards in question and are incapacitated for 
rational tule-following later. Of course, in all of us there is left what Freud called 
a ' precipitate ' of parental prohibitions, Following rules in a rational manner 
can only be viewed as the end-point of a continuum. My point is that Freud 
i tions for people who got 


ai, and the genital sta; es. The t i 
by the child to regulate his Sateen 7 by 
Freud suggestedis that cleanli 


against wishes for organ 


parental pressures. Similarly Sarcasm, sc 


I have often been Puzzled ab 


out what F 
this theory. Was he explaining Bie 


ud thought he was explaining in 
the development of what we call character ? 


Áa 
B_n 
— 
ei se 


$^ 
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Have the carefulness of the calculating determined busi 

A b T usine: thi is i 
accordance with the logic of the situation, and the ee mes 
js not, a similar explanation, the difference being only one of degree ? His 
theory was that anal character-traits are produced by fixation at this stage of 


sexuality. Did he assume that those who are not fixated but who pass through 


this stage in accordance with some assumed norm of development, go forward 


with a minor imprint of orderliness or obstinacy stamped upon them ? 

In order to get clear about what Freud’s theory explained it is first necessary 
to make some distinctions. By‘ character ' we can mean, first of all, the sum 
total of a person's traits like honesty, considerateness, punctuality, and so on. 
We speak of giving a servant a ‘ character’ in thissense. Presumably, too, the 
famous Character Education E nquiry of Hartshorne and May was concerned with 
the investigation of character in this non-committal sense. For little more was 
investigated than the incidence of traits like honesty, deceit, and so on. 

on the other hand, talk of types of character, they 


usually mean more than just the sum total of traits. They see a certain arrange- 
ment of traits which are usu d to one, as in the case of the 
penurious man. Alternatively, a whole range of traits might be displayed in a 


distorted or exaggerated manner, as in a pedantic person. 
a ing character ' in a third sense, when we 


speak of a type of consistency which is imposed on o 
higher order principles suc! dence or justice. Roback, in his 
classic on the psychology of character! spoke of character in this third sense. 

what he said about the function ofthe self-regarding 


So also did McDougall, in í : 
pt of the strong ego is very similar. i 
itsi i in the context 


sentiment. Freuds ounces Wes 
Now Freud's theory © charac c 

of characterology. Indeed, Ernest Jones speaks of his Character and Anal 
Erotism as a contribution to, this sort of speculation 
style. He was surely explaining men who had types of characte’ 
certain traits were 50 dominant that they provided a unifying pattern. And 
explains the connexion between certain types of character and 
ionals, he claimed, manifest exaggerations bs anel fria p 

i i ‘al traits. Freud, I suggest, Was per ectly familiar wit 
schizophren ie do okor ho crops up over and over again in characterology. 
i this pattern of traits 


His genius gsigning a common cause to both. He was put on to 
and t type E dings o n the Theory of Sexuality which just 
is e fin en 

guetedbd his paper on characte acco only for obvious types of 
Hu. type Le: aucertain extent, fort i an who always tends 

2 E optimistic 
to be a bit more cautious OT me bise of the judge, 
©, decisions like cople learn 


standards governing tion is that 
examiner. The usual assump "they gradually manage to master the know- 
ts about such matters. We do not 


kind of a renticeship, s z E 
how bei or making rational Judge t all cases of caution in business, or 
have to tell the Freudian oe n f sentence. But when we notea 
rigour in mar 4 every the other, when there is a consistent deviation in 
characteristic bias 00° M p fa begins to look as if the Freudian type of explana- 
one sort of dimension E ! 
ion i y hat the evidence 1s for Freud's specula- 
tion cs might well be a ith which I am not here concerned. For 
d . another m E tion as to what the theory Was meant to 


tionsanyway. Tha inno ect 
it surely presupposes the p? 
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explain. I have tried to show that it is to be seen as e 
character, not people who have character. Freud's co 


nations of why people do not attain the rational 
level of rule-following. I hav. in his implici i 


e with principles of prudence, impartiality, respect 
for others, and so on, In hi i xplains how it is that people 

ecause they get fixated at some earlier period and 
develop a type of character o 


come to adopt rules in an 
t the period of Super-ego formation—Piaget’s 
transcendental stage. 


of child-rearing and the 


y about techniques 
ment, which Freud tre 


ur rational develop- 
ats almost as a 
ance, certain t 


Freudian theo; 


A ra j European society. A 
the moment engaged uo” lines in America!? and A N. Oppenheim is at 


n ambiti q 
empirical knowledge abou MOUs Project to 


: : in our 

It is prett intl t our own Society, remedy just this defect in ox 
appealing to A nitoning qu, Po Petit pur ignorance on such matters by 
behaviour of animals qud nd to far-fetched analogies between the 


e 

passing on rules to children " i CUM aire aly quite different ways of 

Rie : Ot which Conditioning ; al 
confusion is only increaseq if these differen 8 1s only one. So conceptu: 


ways of passing on rules are not 


© 
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distinguished as a imi i igati 

1 preliminary to investigating what t i i 
different techniques at different ages. The type of hiec nt 
used on animals, for instance, must be distinguished from that form of Gone 
à Whi and p promised and threatened and not Jost 
a ministered. This, in its turn, distinguished from the use of authorit 
which presup poses e voice, but, in giving commands rather than in 
promising and warning. s imitation, suggestion, and learning ‘ by 
experience ' as well as rati instruction. Given such initial Eare e 


questions like this could be asked : 
en used at early ages where rational ex 


passing on rules, whi 

eei are out of the question, incapacitate the 
at later ages? If Piaget is right in saying that there is a correlation betwee 
child's attitude to the rules of marbles and to moral rules as he develops, hs 
s of learning rules which, in general, 


any generalizations be ma 
help or hinder the development towards the more ration 
land, reported experiments Ww 


when he recently visited Eng t 


on the use of different techniqu' 


ages. He has been studying, tl 
instruction with an explanation, and so on, 


Could anything be done 
lines? Do some te i 'g., conditioning—lea 
and compulsiv i 
rather than help to develop 
colourless conformity 9 
transcendental stage ? 
V.—DEVELOPMENT or FREUD'S THEORY. 
These omissions in Freud's theory are connected with a more general 
weakness—the lack of any detailed theory of the development of the ego, of 
rationality in general of which morality is a particular case. In the main those 
who have followed him, like Melanie Klein?!, have revised some of the details of 
his theory ab out the development of the super-ego—for instance, the age at 
^ Others like Horney”? and Fromm? have criticized his 
t is too biologistic and takes t jf 


d to break-downs, blocks 
at later stages? Do they encourage types of character 
at sort of training leads to the 


really emerges from Piaget’s 


which it is formes. 5 : 
theory on the grounds that 1 ) f oo little account of 
r that it treats infantile sexuality and the mechanisms 
determinants of, rather than occasions for, the 


flerences, OF *: 
with it as O°” 1 
eral orientations. 
/hich are particularly 


sence which is k 
conscience 7 t object of a child’s love. 


hati other, 
hating the m d the bad are otter E 
d by his own efforts or omissions, 


h s the be nning of 3 2 

eae a js to Ae made of Money-Kyrle s genetic speculations, which owe much 

to Melanie Klein, his theory 15 interesting In that he sees the connexion 

between rationalit and what he calls the | humanistic ' conscience and in 

that he tries to connect different types of ‘ conscience '—e.g., that of the 
e hypomanic, and the hypo-paranoid, with different types of 


obsessional, B® — nal 
m rationality- 
of Freudian 


more relevant line 


falling off fro . 
nere is, however, à second and even mo € 
nted b Erikson,” Hartmann, and Rapaport," who lava 

mous ego which was rather embryonic in 


thought represe y 
eloped the theory of the autono 
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Freud’s writings. Erikson has tried, in very general terms, to map the develop- 


ment of the autonomous ego and to make generalisations about conditions 
which favour or hinder it. R. 


which does justice to the autonomous ego ^ 
the development of autonomy, and to connect his the 
Such as those about stimulus deprivation, hypnosis, 
Piaget's views on the origins of intelligence in children. 


development what is needed more than in 
PSychology is a combination of concrete investigations 
with conceptual clarity. And this is the point from which my paper started. 
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METHODS FOR ASSESSING THE VALUE OF 
EXHIBITIONS 


By D. SHEPPARD 
(The Social Survey, Central Office of Information.) 


Summary. Some who design or use exhibitions to convey information are 
not aware of the research done on their effectiveness. A discussion is given of 
the various methods which can be used for such studies and a hypothetical 
example is given to illustrate their use. 


I.—INTRODUCTION. 


WnuILE the enthusiasm for visual methods of teaching has by now somewhat 
died down, this is probably due more to the dictates of fashion than to the 
results of experimental tests on the value of such methods. Nevertheless, 
quite an amount of data has been obtained on this topic, not only concerned 
with exhibitions, but also with visual methods for teaching skills (Vernon 1946, 
Laner and Vernon 1957) and with the effectiveness of diagrams and graphs of 
the kind used to illustrate written factual reports (Vernon 1950, 1951, 1952). 
Exhibitions will be the main concern here, using this term to include 
museums, government exhibitions, displays at agricultural shows and exhibi- 
tions on special topics prepared by commercial organizations, or by learned 
societies. As stated, quite an amount of research has been done on exhibitions, 
but because of the diversity of exhibitions studied, it is difficult to form any 
generalized conclusions from them (Sheppard 1959). It follows that a discus- 
sion of the methods used should be valuable, since the exhibit designer will have 
to conduct his own studies if he wishes to assess the value of his efforts. It is 
relevant to note that such handbooks as are available, on ways of preparing 
exhibits, seem to be founded less on the results of research than on the opinions 
of ' experts '"—opinions which are sometimes demonstrably incorrect. A hand- 
book which does seem to have taken some research on these topics into account 
is one prepared by the Food and Agriculture Technical Information Service of 
O.E.E.C. (1958). . ! LA 
Before discussing methods for evaluation, differences between exhibitions 
should be considered. A main difference is in their purpose: a museum, or à 
government-sponsored exhibition, is usually designed to educate, while a 
commercial exhibition is usually intended to induce some particular action such 
as the purchasing of goods displayed. Some exhibits have a function lying 
between these two extremes, as for example displays prepared by the Ministry 
of Agriculture, Fisheries and Food, which are intended to advise farmers on 
techniques, although not to sell them specific branded products. Few exhibi- 
tions are intended only for amusement, except those on piers at the seaside. 

Ain difference is in the type of visitor. Exhibitions such as the 
Win are intended ior the general public, while exhibitions 
Such as that held by the Physical Society are intended for specialists. Museums 
and agricultural shows serve both the specialist and the genera] public, although 


schemes such as charging higher entrance fees for certain days may be used to 


segregate these two groups. — i P 
i a more difference is in the duration of such exhibitions. A display in a 


i for years, while an agricultural 
may remain unchanged years, 
misce qi ot Royal 7 lasts only for four days in each year. 
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* Turning to methods for evaluating exhibitions, it will be best to list these 
and to give some discussion of the merits, or limitations, of each. An example 
will then be given of how a particular exhibition might be studied. 


IL—THE METHODS. 


Something can be learnt about the value of an exhibition by counting the 
number of tickets sold, the number of catalogues sold, or the number of leaflets 
taken away. General indications of such a nature can be useful when the 
exhibition concerned is a regular event, to see if it is becoming less, or more, 
popular ; to give ideas as to the best site to use, or to compare different ways of 
advertising it. To make more detailed deductions from such evidence is 
unwise, an example being a report by Knee (1953), in which he states that the 


value of Ministry of Agriculture stands at shows can be deduced from the fact 
that 60,000 technical bulletins were sold to farmers 


than one. Furthermore, while such a 
the bookstalls, it provides no evidenc 
exhibits at these stands. Countin; 
can be useful. Thus counts can be made, at intervals, of the 


[ but this was because 
the bookstall was, on the average, no greater than for otl “uae 


Our ce h ) her exhibits. 
urther method consists of observing individual visitors a , 

the exhibition. Thus the observer may note which exhibits der anne 
the order in which he looks at them and the time he spends at each. Such 
observations have been made in a number of investigations (Marston 1925 
Robinson 1928, Willcock 1949a, b, Orlanksy 1951, Brooks and Merion 1956. 
Sheppard, Smith and Carrinci 1960), one Suspects in some cases because the 
Investigators were apprehensive about visitors' reactions to bein à stioned 
The method is useful and objective in the sense that there should b tem ticall j 
no subjective element in the data obtained, the observers’ role } icine Drs 
more than that of a ciné-camera. S here being little 


e The visit & 
trusively, so that it can be assumed behavian usually be observed unob- 


É I he is behaving nz 5 A 
on this at agricultural shows indicated that only 3 ae ap n Dc 
. visitors w 


had been observed had noticed that this was bei 5 

three main disadvantages : a long time is rupi it M oe Pure is 
not always possible to collect data for any considerable num] SEO S0 thats 
provided as to the visitors occupation, which can be j "isi No evidence is 
investigator is primarily interested in a particular Section s ent where the 
examples are the agricultural show Studies, where th the visiting public: 
farmers. Finally, the method gives no indi e main interest was in 


even though it may be possible to say dee M ; "E effect of an exhibit, 
time than is needed to read or see all that is displayed th spent more (or less) 
can readily be obtained for this Purpose, by timin ere. “Standard ’ times 
instructed to see exhibits properly (Willcock 194 & people who have been 
Carrinci 1960). 9a, b, Sheppard, Smith and 
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A variation on this technique is to observe visitor behaviour at a few selected 
exhibits of particular interest to the designer. Less time is required per visitor 
if this is done, though it is not then possible to make a generalized statement 
about all the exhibits, nor to compare the attactiveness of the exhibits studied 
with all the others. Otherwise, the same advantages and disadvantages 
apply. 

A fixed camera can sometimes be used instead of an observer, when the 
exhibition is concentrated in a fairly small space and is reasonably well lit 
(Nielsen 1946). It then becomes possible to make detailed studies, unob- 
trusively, of several visitors at a time. However, at crowded periods, it may 

rove impossible to identify and trace the course taken by selected individuals 
(Nedzel 1952). This could well bias the results considerably, since it means that 
no such data can be obtained at times when, ideally, larger numbers of visitors 
should be included in the sample ; these would also be times at which visitors 
would be expected to behave differently, just because the exhibition is crowded. 

This leaves interviewing to be considered. There is little need to be 
apprehensive about doing this, for visitors approached at exhibitions rarely 
refuse to be interviewed. Refusal rates vary from 2 per cent. to 7 per cent. 
(Sheppard, Smith and Carrinci 1960, Orlansky 1951). Interviews need not be 
confined to the exhibition itself, and studies of visitors before they go to an 
exhibition, or after they have left, can provide very valuable data. 

Only by interviewing can one assess the effect of the exhibition on the 
visitor's attitudes, interests, knowledge or activities. On the other hand, the 
assumption is implied that the visitor usually knows what he has seen and what 
effects the exhibition has had on him, and that he is prepared to be quite frank 
and honest about such matters when he does know. The possibility always 
exists that he will seek to justify his actions in coming, or that his responses will 
be influenced by his judgment of the status of his interviewer. 

About the best method so far devised is to observe visitors as they look at 
exhibits and to follow this up by interviewing the same visitors as they leave the 
exhibition (Willcock 1949a, Orlansky 1951, Sheppard, Smith and Carrinci 1960). 
This method can provide information about all exhibits seen by a visitor; à 
visitor's statements about what he saw can be compared with the results of the 
observations, and all such data can be related to his occupation or specialized 
interests. Some of the disadvantages which apply individually to observations 
and interviewing still apply when these methods are combined, while this 


technique has the added limitation that it takes an even longer time for each 
visitor than straight observations. 

There are two further methods which, because of the sampling inadequacies 
inherent in them, cannot be used for a serious investigation. The first consists 
of noting remarks made by visitors who are looking at the exhibits and the second 
consists of asking questions of visitors who, on their own initiative, approach 
a demonstrator for help or advice. In neither case are those concerned likely 
to be typical visitors. However, such methods may be useful in a pilot investiga- 
tion, when searching for clues to be followed up in a later and more extensive 
study. . i NS 

Sampling is a key problem in all such investigations, and a brief discussion 
of this is necessary. Random sampling is rarely feasible, but a method which 
can be used is that of sampling visitors at fixed intervals. Thus, counts may be 
made of visitors during every alternate half-hour period, or every third visitor 
to leave the exhibit may be interviewed. With observations or interviews, 
however, it is difficult to use this fixed interval sampling method when the 
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attendance rates vary considerably with time of day. The most usual alterna- 
tive procedure is for each observer to select, say, the third person to enter the 
exhibition after the last person who was observed has left. Interviewers may 
follow a similar procedure (Willcock 1951). If counts of the numbers of visitors 
who see the exhibit are also made, the data obtained can be weighted so as to 


the observer's, or interviewer's, Prejudices to influence the choice of visitor. 


One special problem arises when the survey is to be concerned only with a 
particular group of visitors. I i i i 


i at the original method 
one, the irrelevant data can be thrown away 
i ence that this group of 
Although such a sampling 
y, it can be frustrating when 
visitor who is not a member of 


alization will have to be made from a 
ns that averages are then subject to a 
able in such studies, for to use 


IIL—Ax EXAMPLE, 
Suppose that a commercial firm wishes to find out if i 
held at an agricultural Show, is ' effective,’ Mi. standy 


1 ' * Suppose further that the firm 
concerned is generous with funds and, despite pressure, those responsible refuse 
to define What is meant by effective,’ They have however, decided on what 
their exhibit is to be about and on how to à 


: thus, attempts 
Sitors visit the 


; how many can ri 
; ecall what Was 
many make a purchase as 4 result. shown, some weeks later, and how 


Having persuaded the firm’s representatives to 


farmers and to limit the area they care about to the e irs MEANS, as 


is held, it is decided to do 4 sample sury county in which the show 
the show is over. This will tell fairly precisi dom Sample of farms, after 


number of farmers in 


+ ————— —— 3 


C E e LT ———— a. ee ee 
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this area went to it. However, if few went to the show, such a survey would 
provide little other useful evidence except perhaps some reasons given by 
potential visitors for not going. 

The investigations have to be planned before the show takes place, so the 
possibility must be considered that this sample survey will produce few farmers 
who went to the show, fewer still who attended the firm’s stand, and almost no 
one who purchased the firm's products. Consequently, it is decided to do some 
studies, at the show, of visitors to the stand, using sufficient investigators to 
allow a small number (say five) to be on duty throughout the day. One is 
detailed to make counts of visitors entering the stand, two to interview visitors 
leaving the stand and two more to observe visitors going round the stand and 
afterwards interview them. 

The counts will provide data on the number of visitors, and the observations 
followed by interviews will show with certainty what exhibits were seen by 
farmers. The proportion who learn anything from the exhibits can be assessed 
by asking the visitors to recall what they saw. More data on learning can be 
obtained by examining the larger number of interview schedules completed 
by those who are interviewing only, for these will have asked visitors which 
exhibits they saw, and what they learnt from those they claimed to see. Some- 
how, an attempt will be made to allow for the fact that some visitors will not 
really have learnt anything, but will have been able to guess what was on the 
exhibit from their general knowledge of the topic. It will be important to 
know this if any appreciable number of visitors are able to say what was on the 
exhibits though, of course, it will matter little if only 1 per cent. or so appear to 
have learnt anything. 

Since addresses will have been obtained from the farmers interviewed, it is 
an easy matter to call on them, or perhaps write to them, later, to see if they have 
since purchased a sample of the firm’s products. If they have done so, they can 
be asked if this was a direct result of seeing this product exhibited, though that is 
not to say that such a question can always be easily answered. The farmer 
concerned may not know precisely what influenced him, or his purchase may 
have been a result of cumulative influences. 

The counts of visitors will provide some basis for re-weighting the data, 
both to allow for the variability of the sampling fraction and to enable percent 
ages to be expanded into absolute numbers: the firm’s main problems can 
thereby be answered satisfactorily. Let us suppose, however, that the results 
turn out to be disappointing. The firm, therefore, wants to know why only 
3 per cent. of those who came to their stand purchased one of its products. The 
data obtained show that their exhibit failed because many farmers did not learn 
what was shown—but was this because these exhibits were incomprehensible ? 
Likewise, those who learnt what was shown did not put into practice what they 
learnt—was this because they did not believe in the firm’s recommendation, or 
because they forgot what was shown ? 

It is now too late to measure comprehension at this show, but perhaps at a 
future show an investigator could accompany certain visitors as they go round 
the exhibits and ask what they understand from them, or a ‘ classroom experi- 
ment ’ could be done, showing such exhibits to a sample of farmers, and match- 
ing their characteristics with those of farmers who attend shows, by methods 
such as the B.B.C. have used (Belson 1956). This would show if a particular 
exhibit could be understood by most farmers. If so, the amount of time needed 
for full understanding could be compared with the amount of time spent in 
looking at this exhibit under typical show conditions. An alternative method 
which has been tried, consists of asking visitors to shows if they think that the 
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exhibits could have been made more easy to understand. Visitors tend to 
regard this as a potential slight on their intelligence, and not as a reflection on 
the exhibit designer’s skill, so that data thus obtained are of little value 
(Sheppard, Smith and Carrinci 1960). 


Belief is more troublesome to investigate. Satisfactory data on this can 
scarcely be obtained at the stand, but an (apparently) independent survey at the 
farms concerned might throw light on this. The situation may be complex, for 
the firm itself may be judged to be untrustworthy, or it may be regarded as 
trustworthy but the evidence adduced to prove the merits of the products may 
be judged unsound or insufficient. 


Long-term recall is, of course, important. A disadvantage of assessing 
recall at the stand is that this gives an indication only of the maximum number 
who will be able to recall something from an exhibit later. If recall is tested at 
the time, it is not satisfactory to try to do repeat tests later on the same groups, 
as Bloomberg (1929) did. If memory is not tested at the show, but the addresses 


of the farmers are obtained, a survey could be carried out to test long-term recall 
among the farmers concerned. 


A commercial example has been taken here and it may be queried if such a 
study. would be typical. In fact, these methods could be applied to almost all 
exhibit evaluation studies ; what is unusual in this example is that the aim of 
the exhibit could be unambiguously defined, in terms of sales, and its success 
thereby made easier to assess. At museums the aim is to educate, and the 
investigation could be concluded at the stage of long-term learning. A difficulty 
arises in defining what should be learnt, for the exhibitor's ideas of what are the 
main points of an exhibit may well not be broad enough to include all that some 
visitors could learn from it. In fact, something may be learnt which a ears 
quite irrelevant to the main purpose of the exhibit: to illustrate this beer y 
quote a remark made by a farmer who saw an exhibit featurin live ttle, 
intended to show the value of different feeding methods. The farme ENS 


quite happy because he had seen an exhibi i t ie 
de-horn their cows.’ Ibit which showed that ‘ the Ministry 


IV.—Conciustons. 


The data discussed here make it clear t} 

; 3 hat the Ea 
used to examine the effectiveness of exhibits, one y mich can be 
may allow generalizations to be made about the value of ex ibit of this nature 
ways for improving them. X^übitions, and about 


Some who use exhibits seem to despai i 
. r 5 air of > s 
much evidence for this one way or the dim Ae un, bon often without 
studied more fully, if only so that their effectiveness ca um ibits deserve to be 
of other ways for transmitting inform SS Can be compared with that 


J ati ; 
compared with other sources of monde Ms ee Cost of exhibits as 
Should also be consi 

nsidered. 
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SUMMARY OF RESEARCH REPORTED IN DEGREE 
THESIS 


A Study of the Nature of Spatial Ability and its Relationship to Attainment 
in Geography. 


By C. COLIN TAYLOR 
(Abstract of thesis submitted for the degree of M.Ed., University of Durham, Oct., 1959.) 


T.—AIMs OF THE INVESTIGATION. 

(a) To investigate the relationship between spatial ability (‘k’ factor) and 
(i) the ability to perceive and reproduce the proportions of a figure and (ii) the 
ability to perceive and reproduce details. 

(b) To investigate whether spatial ability (' k ' factor) is related to attainment 
in geography. 

The first part of the study originated in a suggestion made by Macfarlane Smith? 
that spatial tests which depend for success on the perception of the form or structure 
of a figure (proportion) tend to have higher k-loadings than those which depend on 
the perception of details. The second part was intended to be an experimental 
examination of claims made by previous workers that spatial ability is important 
for success in geography. 


Il.—Tue Tests UsED. 

The following twelve tests of ability or attainment were administered : 

Test 1—Moray House Verbal Reasoning Test (Series 51 [52). 

Test 2— Non-Verbal Test 3 (N.F.E.R., Calvert). 

Test 3—Spatial Test I (N.F.E.R., Smith. Total score in sub-tests 
involving little imaginative manipulation of figures). 

Test 4— Spatial Test I (Total score in sub-tests 1 and 6. 
much imaginative manipulation). 

Test 5— Clerical Test 1 (N.F.E.R., Richards. Total score in Sub-tests 1, 2 and 
4, involving perception of detail). A 

Test 6—Score in Geography, obtained by standardising the marks in school 
internal examinations to a common mean and standard deviation, 

Test 7— Objective test of locational geography (in map form) 

Test 8—Objective test of geographical know 
form). 

Test 9—Objective test of geographical distributions 

Test I0—N.LI.P. Test of Memory for Designs. 
original marking scheme. 

Test 11—Score in Test 10 combined with a Drawing of a Man Test. Both 
marked for correctness of detail, i.e., the number of items or parts which were 
perceived as making up the composite whole. This was intended to test an ability 
to perceive details. : 

Test 12—Score in Test 10 combined with a Drawing of a 
marked for correctness of proportions. The perception of prop 
regarded as the perception of the parts of the design in relation to the whole, i.e., a. 
due relation in size between the parts and the whole had to be perceived nie 

The marking for correctness of proportions was carefully carried out by the 
writer, the measurements being made to one-tenth of an inch, x 

Tests 7, 8 and 9 were devised by the writer personally, 


2; 3, 4, 5, 


These sub-tests involve 


ledge and reasoning (in verbal 


(in map form). 
Marked according to the 


Man Test! Both 
ortions was to be 


III,—ApMINISTRATION OF THE TESTS. 
The test populatisn consisted of 100 third-year girls and 100 thi 
E th year 
both groups having an average age of 14 years 4 months. These th yarrow a 
secondary commercial schools in Newcastle-upon-Tyne, Reliabilite coefficients 
were calculated for all the tests (except where given in the manuals) d 
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These coefficients were usually high, seldom falling below :8, except in the case 
of the school internal examinations, for which the reliability coefficients were -457 
and -379, for boys and girls respectively. 

Product-moment coefficients of correlation were calculated between all pairs of 
variables, separate matrices being calculated for boys and girls. 


IV.—RESULTS. 

(a) Sex-differences. Mean scores for boys and girls were calculated for all 
variables, and critical ratios were determined to assess the significances of the sex- 
differences. These are shown in Table 1. Marked sex-ditterences (C. R. 72-58) 
occur for all the spatial tests (3, 4, 10 and 12) in favour of boys and for the verbal 
test (1) in favour of girls. 


TABLE 1 


‘Tue Sex DIFFERENCES FOR THE TEST BATTERY (+= Favours Boys). 


Sex Mean Score Difference of Standard Critical Ratios 
Means Deviations 
1 | Boys 71-17 —5-94 8-900 4-752 
Girls 77-11 8-770 
E: n 71:55 — 1-80 12-381 1:133 
G 73-58 9-951 
3 B 71-93 10-75 13-240 5-984 
G 61-18 2-143 
E B 26-14 4448 7478 4-956 
G 21-66 6-255 
CB | e 67-61 —4-04 14-748 1-865 
G 71:68 15-861 
6 B 49-73 —0-26 9-815 0-1876 
G 49-99 9-884 
7 | B 49-52 3-54 20-530 1-419 
G 53-06 14-187 
8 B | 31-22 —0-74 7-669 0-724 
G 31-96 6-757 
9 B 15:52 40-75 5-683 1-008 
G 14-77 4-808 
10 B 33-04 +3-12 3-477 
G 29-92 
T B 63-33 —0-67 15-534 0-324 
64-00 13-670 
8-005 


(b) Factor Analyses. Centroid factor analyses were carried out on the correla- 
tion matrices for both boys and girls (cf. Tables 2 and 3), the resulting unrotated. 
factors being interpreted in the light of the hierarchical theory of the structure of 
mental abilities.9 
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TABLE 2 


UxnorATED CENTROID FACTORS (AFTER ONE IrERATION)—CENTROID FACTOR MATRIX 
FoR 100 Boys. 


(with proportions of variances contributed by the centroid factors, obtained communalities 
and guessed communalities.) 


F 
g 
& 


I III? h? h? 
(obt'd) (guessed) 


:0232 -0763 -2651 — 4038 
:0112.— 0124 *4170 -5301 
“1985-1199 :9007 -9390 
1849 -1393 "8346 -9390 
0055 -0099 "2487 -4135 
:0463 -0122 :3207 -4558 
:3881 -0015 "8386 -8405 
:3786 -0138 "7647 +8405 
:2796 -0137 :7695 -8035 
‘0819 -1563 8664 -876G 
:0768 -1087 4652 -5876 
"0898 -1052 :8088 -8766 


1 E 
2 : 
3 T 
4 TO 
5 +: 
6 T5 
vi TC 
8 shes 
9 T 
10 TO 


dH ERE E IG IG 


des 


4-9564 1-7144 -8292 7-5000 


"EM % 66-09 22-85 11.06 100% 
Variance 


% Av. 41:30 14:29 6-91 


TABLE 3 


UNROTATED CENTROID FACTORS (AFTER ONE IrERATION)—CENTROID Factor MATRIX 
FOR 100 GirLs 


(with the proportions of variances contributed by the cen 


troid factors, obtained and sed 
communalities), en 


F 
g 
& 


h? h? 
(obt'd) (guessed) 


4662  .4694 
"6238 -6629 
"7745 -8913 
-8913 
4573 
75777 
:7464 
:7464 
:4938 
:7289 
-5439 
+7289 


O0 09 oto-| 


Variance 


x -m y 
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The psychological interpretation was confirmed by carrying out a group factor 


analysis, which gave more satisfactory results for the boys than for the girls. 


The following scheme was tentatively proposed to account for the groupings of 
the factor loadings : 


Factor I g 
| 7 | 
Factor IT vied k 
| --—— 
, | | ` 
Factor 111 m v s,(Perception s,(Mental 
(Rote (Verbal and reten- manipula- 
memory) facility) tion of tion of 
| shapes) shapes) 
| 
| | 
P. Ps 
(Perception (Perception 
of of 
proportion) detail) 


V.— CONCLUSIONS. 
(a) The Nature of Spatial Ability (‘k’ factor). 


(i) Test 3 (Sub-tests 2, 3, 4, and 5 of Spatial Test I) and Test 4 (Sub-tests 1 
and 6) showed marked sex-differences in favour of boys, but the most striking sex- 
differences in favour of boys was shown by the Memory for Designs Test, when 
marked for correctness of proportions (C.R.=8-0). i 

A small but insignificant sex-difference in favour of girls was shown by the 
Memory for Designs Test, when marked for accuracy of detail. Thus, marking this 


test for detail has produced a large ‘ swing ' away from the normal sex-difference in 
spatial tests in favour of boys. 


(ii) The factor analyses show that the three ways of marking the Memory for 
Designs Test all give rise to high spatialloadings. They also indicate that, both 
for boys and girls, the Memory for Designs Test has a higher ' k' loading when 


marked for correctness of proportions than when marked for accuracy of detail 


(thus confirming one of the hypotheses being investigated). 


(iii) Evidence was obtained suggesting that a sub-factor S,, involving the 
perception and retention of impressions of shape, contrasts with a sub-factor S, 
involving an ability to manipulate shapes mentally, both sub-factors entering into 
spatial ability. In the case of boys, the loading for S, is highest in the Memory for 


Designs Test marked for correctness of proportions, while for girls it is high 
this test is marked for accuracy of details. ghest when 


(iv) The analyses also suggest that boys are superior to girls with regard to 


sub-factor S, and that girls apparently possess greater ability thi Ñ 
involving sub-factor Ss. y than boys in tests 


(v) The results are consistent with the view that spatial ability (' k’ 
involves a capacity to perceive and retain mentally an impression aH din ee 
or form of a shape or pattern as a whole (cf?). Tests designed to emphasise this 
capacity show the largest sex-difference in favour of boys. p 


(vi) The most striking result of marking the Memo i 
accuracy of detail is a substantial reduction in the fel ets bags a. 4 
method of marking also caused the sex-difference to ‘ swing ’ in favour id ei z 
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(b) The relationship between spatial ability and attainment in geography. 


(i) There were no significant sex-differences in test performance in any of the 
geography tests. 

(i) The factor analyses provided no evidence that spatial ability (' k ' factor) 
enters into attainment in geography, as measured by the school internal examination, 
or by the objective tests Specially prepared by the writer. The loadings in the first 


bipolar factor indicate that scores in the examination and tests in geography are 
sharply contrasted with scores in the spatial tests. 


(ii) The four measures of attainment in geography had higher ' g ' loadings for 
boys than for girls. 


(v) Attainment in geography was highly dependent on verbal-educational 
ability and this dependence was even more marked for girls than for boys. 


VI.—REFERENCES. 
!GoopENouaH, F. (1926). The Measurement of Intelligence by Drawings. New York : 
World Book Coy. * 
2 SMITH, I. MACFARLANE (1954). 
1,5, 19-33. 
? VERNON, P. E. (1950). The Structure of Human Abilities, 


The development of a spatial test. Durham Res. Rev. 


London: Methuen, 


NOTES AND COMMENTS 


COMMENTS ON ARTICLES BY MR. D. G. LEWIS AND MR. W. R. JONES 


By J. L. WILLIAMS 
(University College of Wales, Aberystwyth) 


Reports of attempts by Mr. D. G. Lewis and Mr. W. R. Jones at comparing the 
attainments of bilingual and monoglot children in Wales may be of some academic 
interest and their disagreements can be regarded as somewhat amusing, but it is 
very doubtful whether investigations of this nature can ever make a worth while 
contribution either to the study of bilingualism or to the means of attaining a high 
degree of bilingualism among communities where such an educational aim has been 
accepted. Mr. W. R. Jones’ reference to " somewhat contradictory results ”” already 
echoes through very many investigations in many countries, and more variations on 
the same theme can hardly be expected to be anything but tiresome. 

The findings of both writers are bound to be inconclusive and of little value as 
long as they continue along their present lines of investigation. It appears that no 
significant step forward has been taken in the planning of such investigations since 
the pioneer work done by Saer in 1922 and 1923. Any advance made has been 
limited to the use of better tools and improved tistical methods. 

Investigators have consistently failed to isolate bilingualism as a variable. 
Even the importance of socio-economic factors has been ignored until Mr. Jones 
made a belated ' discovery ' of its significance in his latest publication.* This is 
strange in view of the fact that it has been known for many years that the scores 
of groups of children in various tests tend to vary in accordance with such factors as 
parental occupation and other background variables. In fairness to the two 
investigators and their predecessors in Wales, one should, however, note that many 
other research workers in the same field elsewhere have unwittingly committed 
similar sins of omission, with the result that their investigations too have failed to 
produce useful knowledge, let alone valid conclusions, concerning the eftect of 
bilingualism per se upon child development and educational progress. 

Such examples of non sequitur as Mr. Lewis's statement, ‘‘ One can conclude 
then that linguistic backrgound can be regarded as an important factor of the school 
differences in attainment in English which are not explained by differences in 
intelligence " are common in reports of investigations of this nature. Is it not 
possible that several schools “ drawing most of their pupils from Welsh-speaking 
homes," in his sample, were carrying out the language policy recommended by the 
Welsh Department of the Ministry of Education and, consequently, did not make 
any use of English in the education of pupils under the age of seven, and even in the 
junior school devoted much time to the development of command of the mother 
tongue (Welsh)? If schools in his sample followed what is the normal practice in 
Welsh-speaking areas, it would be much more reasonable to conclude that school 
differences in attainment in English and arithmetic arose from school language 
policy, and in particular from variations in teaching time and language contact under 
school conditions, rather than from differences in linguistic background outside the 
schools, in spite of the evidence provided by statistical work done by Mr. Lewis, Mr. 
Jones, and others. 

It is suggested respectfully, that the time has come for potential research 
workers who are anxious to tackle problems associated with bilingualism in Wales to 
be directed into more fruitful soil than that to be found in the often tilled, but most 
unproductive field in which intelligence, school attainments in English etc., at 
junior school level only have so often been compared without any consideration 
being given to variations in teaching time, methods of language presentation and 
usage, number of subjects taught, and to important variations in home background 
not, of necessity, linked with language spoken or even with parental occupation. 


* Bilingualism and Intelligence, Cardiff, 1959. 
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REPLIES TO COMMENTS BY J. L. WILLIAMS 


Dr. J. L. Williams objects to my conclusion that “ linguistic background can 
be regarded as an important factor of the school differences in attainment in English 
which are not explained by intelligence,” on the grounds that some, but not all, 
schools were carrying out a special language policy, a policy which could well 
account for the differences in test results. This possibility, however, had been 
considered from the very outset. The head teachers of all the schools were asked 
about their language policy, and a detailed statement was received from each school. 
No marked differences in the schools’ language policy were found to exist. In all 
schools, the medium of instruction (of the subjects tested) was English, Welsh being 
used to but a very small extent. All schools, too, devoted separate teaching periods 
to the development of Welsh, Minor variations in “ language contact under school 
conditions ” must be viewed against this general uniformity of practice. 

In an earlier paper on bilingualism and non-verbal intelligence* groups selected 
on the basis of linguistic background were compared in test performance. Each 


group contained children from all the schools sampled. Note was taken of relevant 
non-linguistic differences between the 
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partially sucessful, but, as every experienced worker in the field of educational 
research will know, complete isolation of any particular variable is often very 
difficult to achieve, especially so in experiments which are complicated by a multi- 
plicity of interacting human and social factors. 

3.—Dr. Jac L. Williams's complaint that no consideration has been given to 
variations in teaching time, methods of language presentation, etc., in comparative 
studies of monoglot and bilingual children will not bear examination for one single 
moment. I would strongly advise him to read i» detail my own researches which 
have been published from time to time as articles in this Journal and as a series of 
monographs by the University of Wales Press Board, so that he may see for himself 
that considerations of this kind were consistently borne in mind in the process of 
interpreting test results. 

4.—Finally, I would point out that, as far as Wales is concerned, the scientific 
study cf bilingualism from 1922 onwards has been pursued by one or two researchers 
as a rule working alone, whereas others, although equally well placed to undertake 
researches of a similar kind, have been content to do nothing other than indulge in 
uninformed and irresponsible criticism. I would respectfully suggest that the time 
has now come for Dr. Jac L. Williams and other ‘ potential’ researchers to cease to 
pontificate generally on the topic of bilingualism and to become active participants 
and practical workers in this most difficult field of enquiry. W. R. Joxzs. 


ON D. H. STOTT'S ARTICLE: INTERACTION OF HEREDITY AND 
INTELLIGENCE IN REGARD TO 'MEASURED INTELLIGENCE ' 


Educational psychologists will be grateful to Dr. Stott for raising afresh the 
far-reaching problem of environment versus heredity, and for his interesting summary 
of recent work in experimental biology. Unfortunately, however, his survey is 
limited almost exclusively to those publications which support, or seem to support, 
his side of the case; and, as readers familiar with his earlier writings will know, where 
intelligence testing is concerned he has strong and somewhat unconventional views. 
“ The legend that the intelligence test measures innate unalterable ability ," he has 
told us, “ was fathered by an earlier generation of psychologists fighting for recogni- 
tion, and mothered by administrators glad for someone else to carry the baby... 
Any clinic which abandoned intelligence tests entirely would not only be able to 
handle more cases, but increase its success rate many times."'* 

As a matter of history, intelligence tests were fathered by Sir Francis Galton 
and Alfred Binet (Professor at the Sorbonne) comparatively late in life. Neither 
had then to fight for recognition. In this country the earliest administrators 
who encouraged the use of such tests were the education officials of the London 
County Council; and in the first memorandum which they issued on the subject 
(a revised version of Binet's tests for use of teachers and doctors diagnosing defective 
children) stress was laid on “ one fundamental truth ; tests, far more scientific than 
those here set out, can still be but the beginning of the examination, never the end, 
of the child,"t In varying terms all of us in those days emphasized that ‘ intellig- 
ence,’ in the sense of innate general ability, was a hypothetical factor, whose amount 
could at best only be estimated, and that the marks or measurements obtained 
with tests formed merely one of the many aids to making such an estimate. Some 
writers like to distinguish the two concepts by using the labels ' intelligence A’ 
finnate) and ‘ intelligence B?’ (measurable). Dr. Stott uses the term ‘ intelligence ' 
to cover both, often without indicating which he has in mind. But no psychologist of 
repute has ever supposed that a crude test-measurement, taken just as it stands 
provides an infallible assessment of ‘ innate unalterable ability.’ ! 
. Dr. Stott, however, appears to hold that the whole notion of ‘ general ability’ 
is an untenable hypothesis. Our ability to measure Specified differences," he says, 


* D. H. Srorr (1956). Unsettled Children. pp. 27-8. 
iC. Burt (1921). Mental and Scholastic Tests. p. xv. 
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“ does not justify us in assuming that there is a unitary factor underlying them. 
Certainly not: the justification lies in the positive correlations (usually quite high) 
between such measurements—correlations which practically every investigator has 
confirmed. And the statistical inference that these are due to a concrete ' factor 
is borne out by a vast array of evidence* from other fields—experimental, introspec- 
tive, biological, and above all, neurological—evidence which Dr. Stott does not even 
mention, let alone refute. . 

But what Dr. Stott is chiefly concerned to dispute is the idea that differences in 
' measured intelligence ' are in any appreciable degree due to genetic influences. 
He begins by stating that the attempt to distinguish hereditary influences from 
environmental “ has been called in question owing to the demonstration of a certain 
correlation," between the two. Maddox, he says, has pointed out that " this 
correlation is due to the Sociological fact " that high intelligence in the parent is 
associated with cultural stimulation in the child. Maddox did not point out this 
causal process; he merely thought that such conditions might account for all the 
correlation between parent and child—a purely speculative suggestion. Dr, Stott 
apparently assumes that, when two such factors are correlated or interact, it is 
impossible to assess by statistical analysis " the relative contributions " of each. 
But Sir Ronald Fisher long ago showed how, by an analysis of variance, this could 
be done, and applied his techniques to the problem of heredity and environment. 1f 
Dr. Stott believes that Fisher's demonstration is faulty, he should at least explain 
where he supposes Fisher has Slipped up. 

Dr. Stott goes on to cite a series of ex 
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animals—notably among mice, Pigs, guinea-pigs, and fowl. In man therefore (so 
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stress. Hence, the natural result, he argues, must be an increased correlation 
between the intelligence of twins. 

But, as a matter of fact, in ‘ non-identical ’ twins the correlation differs hardly 
at all from the correlation for ordinary siblings (see this Journal, XXV, p. 168, 
Table 1). So here there is no indication of the causal process alleged. Dr. Stott, 
however, if I understand him rightly, assumes that there must also be some pre- 
existing genetic tendency which the pre-natal stress ' facilitates ' and that this is 
found in ‘ identical ’ (i.c., one egg) twins. Yet, if that is his explanation, then he is 
tacitly admitting that the genetic predisposition is the decisive factor. 

However, as numerous embryologists have pointed out, the uterine conditions 
are not the same for both members, since they cannot both occupy the same position. 
Usually, the second to be delivered (often still born) is the one that has sufiered 
most. The fact that ‘ identical’ twins develop within the same membrane as well 
as within the same uterus, would appear, if anything, only to aggravate the differ- 
ence. 

But do such conditions " make twins more similar in mentality " or lead to 
“ the production of definite defect ? " This after all is a question of fact on which 
there is plentv of data. In inquiries carried out in L.C.C. institutions and reported 
by Miss Conway, twenty-six cases of mental deficiency in an ‘ identical ' twin were 
discovered, and in twenty-four of the cases the other twin was sof defective. In these 
institutional cases the deficiency was of a low grade pathological type, usually 
accompanied by physical symptoms, Outside institutions, where the deficiency is 
as a rule merely an ex: ive deviation below the general average, of a kind fitting 
in with the normal distribution, the correlation with physical signs was negligible. 

These (ndings are fully in keeping with those reported by most other investiga- 
tors. For example, in a recent survey, Berg and Kirman have tound that, in forty- 
nine cases of mental deficiency occurring in a twin, the other twin was defective in 
only four cases.* ‘Thus, so far as intelligence is concerned, the type of causation 
invoked in Dr. Stott's rather speculative arguments pro to be the exception 
rather than the rule. Certainly, the meagre amount of evidence that he himself 
adduces can by no means sullice to overthrow a conclusion accepted by such 
experienced geneticists as Fisher, Mather, and Darlington, and indeed, if I am not 
mistaken, by almost every leading authority in this country. CYRIL BURT. 


REPLY 


Sir Cyril Burt must have misunderstood the theme of my article. I did not raise 
the problem of environment versus (repeat versus) heredity. This is precisely the 
dichotomous approach I tried to show was unreal. We are dealing with comple- 
mentary conditions for development. Up to a certain point heredity /environment 
is a useful way of conceptualizing two main classes of necessary conditions. But to 
polarize them into a dichotomy is epistemologically primitive. Equally primitive is 
the vitalism of calling them ‘ influences,’ making them conceptual gnomes upon 
whom to pin a percentage responsiblity. Nor am I, naturally, “ concerned to 
dispute . . . that differences in ‘ measured intelligence’ are in any appreciable 
degree due to genetic-influences." I just do not take sides in this unreal battle 
Sir Cyril will have to look elsewhere for an environmentalist to Serve him as ai 
opponent. X 

Iam surprised that he still regards positive test-correlations as implying a 
concrete factor of general ability. Has it not been pointed out many times that 
the general factor which emerges depends upon how the data fed into the machine 
are selected d If the psychologist arranges the test-material in accordance with a 
concept of ' intelligence dominant within his culture-pattern he will obtain a 
general factor consistent with that concept, and, of course, he will identify it there- 


* BERG, J. M., and Kırman, B. H. (1960). “Th t s vin.’’ it 
J.. June 25th, pp. 1911-17, ) e mentally defective twin.” Brit. Med. 
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with. The ‘general factors ’—and, indeed, much that we ordinarily regard as 
evidence of ‘ intelligence '—could be equally seen as a function of motivation and at 
one remove, of the mental skills which are also a function of motivation. 

I disagree also with Sir Cyril Burt's time scale. Foetal damage by environmental 
insult was demonstrated in a mammal only in 1937 by Hale and in man by Gregg 
in 1941. A definite genetic-environmental interaction was demonstrated only in 
1949 by Andersen and in 1954 in more detail by Fraser. In so far as the earlier 
geneticists used the term ' facilitation ' it was in quite a different sense, namely, the 
facilitation of the action of one gene by another gene. 

I would not agree that unfavourable uterine environment is only rarely responsi- 
ble for mental defect, and have in fact produced evidence (1957, 1959) to show that 


some 70 per cent. of mental subnormality in normal-appearing children may be 
thus caused. 


As regards mongolism, again the time-scale : 
first report of an extra chromosome in 1959. It has not been “ amply shown that this 
particular defect results from extra chromosomal material." Results to date, while 
indisputable, are fragmentary : we do not yet know w] 
nor the chromosomal constitution of their p 
anomaly arises. With little doubt anom: 


pre-conditions for mongolism, but it is epistemologically naive to jump to the 
conclusion that is it the cause, 


It must not be forgotten that a lar, i 
: a ge proportion 
(Hertig) of early embroyos are malformed, probably because of ERA error. 
Normally, these do not develop y i of mongols and other 
genetically anomalous malformations i ir 
out seems to fail. I have awaiting publication eviden is i 
; a ce that this is due to maternal 
stress (hormonal disturbance ? domi ; 
ke ) pre ominantly at the second and third months of 


D. H. Srorr. 


Turpins actually published the 


i T MAIN REFERENCES 
REGG, N. McA. (1941). C i i 
Trans. Een on e. ded d aput following German measles in the mother. 

SNORRA D. ise p Effect of diet during pregnancy upon the incidence of congenital 
C DH ^1957)* MERO hernia in the rat." Amer. J. Path. 25, 163-84, © 

; D. H. . Mental and physi Amer. J. Path., 25, 163-84. — 
rory PeRmancy, Lancet, 171, i 100g. ^| handicaps in the child following a disturbed 
ToTrT, D. H. (1959). Evidence for pre. i ; s 

retarded children. Vita Humane 2, 12s aren of temperament in mentally 


CRITICAL NOTICE 


THE CROWTHER AND ALBEMARLE REPORTS 


Fifteen to Eighteen: Report of ihe Central Advisory Council for Education. 
H.M. Stationery Office, 1959. Vol. I, pp. xxxii+-519, 12s. 6d. 


The Youth Service in England and Wales: Report of the Committee Appointed 
by the Minister of Education. H.M. Stationery Office, 1960, pp. iv-- 136, 6s. 


By CYRIL BURT 


Four years ago the Minister of Education invited the Central Advisory Council, 
of which Sir Geoffrey Crowther is chairman, “ to consider, in relation to the changing 
social and industrial needs of our society and the needs of its individual citizens, 
the education of boys and girls between the ages of 15 and 18." Their Report 
presents a critical survey of the present system, and devotes special attention to the 
Secondary modern schools, the sixth forms in the grammar schools, and the various 
types of further education at present available. The inadequacies which their 
investigations have disclosed demand, so the Council contends, a methodically 
planned educational programme covering the next twenty years. 

‘Their essential recommendations may be summed up as follows : (i) a compul- 
sorv extension of the school leaving age to 16 by some date between 1966 to 1969 ; 
(ii) a series of experiments which will lead to a scheme of compulsory part-time 
education, to be instituted during the 1970's, for all youths and girls of 16 and 17 who 
are not then receiving full-time education; (iii) further experiments, leading to the 
introduction of external examinations below the level of the G.C.E., organized on a 
regional rather than national basis, for pupils in modern schools during their fifth 
year; (iv) immediate measures to forestall premature or excessive specialization in 
grammar schools; (v)a closer integration of the training given in school with that 
provided by the schemes of further education, based largely on a more coherent 
national system of education in the widest sense ; (vi) the substitution of two school 
leaving dates for three ; and (vii) last but by no means least, more effective steps to 
meet the urgent need for improving both the quantity and the quality of teachers. 
The total cost of all these recommendations (estimated by adding the amount that 
will be lost in production to the amount directly entailed by the new provisions) is 
put at 7200-250 million a year. But, it is argued, this rather heavy burden must be 
regarded as “ a profitable investment in national efficiency.” More than half the 
extra expenditure will be the effect of raising the school leaving age ; and it is with 
this particular item in the Report that most of the discussions in the popular press 
have been concerned. 

Some of the most interesting findings of the Council are contained in a second 
and supplementary volume. They include the results of three factual surveys 
sponsored by the Council, namely (a) a social survey, dealing more particularly vii th 
the home background and employment record of school leavers and with factors 
affecting the age of leaving school; (b) a national service survey, dealing with the 
distribution of latent ability, and with the occupations, earnings, and leisure 
activities of recruits, and (c) a survey of technical courses. The Report itself starts 
with the question: “ Who gets full-time education? " The figures obtained from 
recruits for the services indicate that 72 per cent. of them left school at 15 or earlier ; 
however, about two-fifths of those aged 15 to 17 got some kind of further education, 
if only for one day a week. But what to the psychologist will seem es cially: 
regrettable is the number of intelligent youths and girls who had failed te secure 
any kind of beor education commensurate with their capacities | 
p, het analysis the Council adopt the " Army measure of ability groups .” 
based : : gence tests and yielding a conventional classification 
into six groups or grades : namely (if we convert the percentages to terms of an 
I.Q. scale with an S.D. of 15): (i) high ability (the top 10 per cent. i.e., over 120 
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1.Q.) ; (ii) moderately high ability (the next 20 per cent., i.e., over 108 1.Q.) ; (iii) 
good average (100 to 108 L.Q.) ; (iv) poor average (92 to 100 1.Q.) ; (v) moderately 
low (under 92 .I.0.); (vi) very low (under 80 I.O.). Of the men in the highest 
group 9 per cent. left school at 15 and 33 per cent. at 16; while 58 per cent. stayed on 
Still longer, usually leaving at 18. But most of the latter were drawn from the 
professional, managerial, and clerical classes. Among the sons of the manual 
Workers, 19 per cent. of those in the highest ability group left at 15 and 44 per cent. at 
16, while only a quarter stayed on till 18. Of the men in the next highest ability 
group 65 per cent. left at 15, 22 per cent. at 16, and only 7 per cent. stayed on till 
18. All this seems at first sight to imply a gross wastage of talent ; and it is largely 
to rectify this loss that the Report recommends the compulsory raising of the school 
leaving age to 16. 

The reasons advanced in support of this proposal are threefold : (i) “ Leaving at 
15 is so deeply embedded in certain parts of the social structure that it is unlikely 
that this waste of talent can be remedied without compulsion.” ; (ii) “ The demand 
for more educated workers is growing at almost every level of industry, and raising 
the age to 16 would give those near the bottom a better foundation ^ ; (iii) " The 
strongest part of the case is the need for a secondary education for all ; which will 
extend through the important and difficult period of adolescence." 

The suggestion that the education of every child should be prolonged until the 
process of maturation is completed forms unquestionably a highly attractive ideal. 
But the real question is whether such an ideal can indeed be achieved by a change 
which is to be carried out through the Schools, and is imposed by compulsion, and 
will, after all, add only one further year, Moreover, it is a question which turns on 
matters of psychological fact rather than of ethical or sociological theory Unfortun- 
ately, at present, there is very little factual evidence available either one way or the 
other. The most we can do is to forget for a moment our cherished idez s y lour 
very natural hopes, and weigh up quite impartially the somewt ag sle: = pie Wn 
ties which attach to the various 5g somewhat slender probabili 
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economical and prudent policy is surely to make tentative experiments in these 
new directions first of all. 

Some information on the problem may, perhaps, be drawn from the experience 
already gained by raising the leaving age from 14 to 15. Recent investigators assure 
us that the extra year of education has had little effect on the educationally sub- 
normal or on the degree of illiteracy and semi-illiteracy in the general population. 
The prophecied reduction in delinquency has certainly not been realized. Before 
the age was raised, the duller 13-year-olds, as most school teachers will recall, were 
far more prone to petty delinquency both in school and out of it; and when at 14 
they at last gained a longed-for sense of self-dependence and came under the new 
discipline of industry, the number of offences declined. In 1947, however, as the 
Report very frankly admits, '' when the school leaving age was raised to 15, there was 
an immediate change " : the age of 14 became the '' peak age for juvenile delin- 
quency .. . and has held that place consistently ever since." But perhaps the most 
convincing comments come from the class teachers who will have to shoulder the 
burden of these duller groups. Visit the secondary modern schools in any of the 
larger industrial areas, and ask the masters who have had charge of the top age- 
group in the C streams: you will find that, almost without exception, they are 
highly sceptical about the advantages that are predicted. They will, moreover, put 
their finger on a further drawback which the Council have apparently overlooked. 
Not only are these unwilling youngsters quite impervious to instruction and control, 
but their very presence in the classroom renders the task of teaching their more 
willing school-fellows far more arduous and ineffectual. 

This, however, is a criticism of the schools as they are organized at the present 
day. Hence, it is not to my mind wholly conclusive. Most of those who have 
discussed the proposal seem to have forgotten a relevant piece of past history. For 
pupils who are definitely subnormal, the compulsory extension of the school period 
would be by no means a complete innovation. Under the older Education Acts 
children who were certified as mentally deficient and transferred to special schools 
were obliged to attend until they were 16. Those of us who are old enough to recall 
the magnificent work done by the schools for elder boys and elder girls are therefore 
tempted to inquire whether the experience then gained might not be profitably 
exploited once again for the same type of child who would now be grouped with the 
' educationally subnormal.’ Might it not even be extended to those who are some- 
what above what was formerly the borderline for certification ? There is one strong 
point in its favour. Before the special schools were introduced, mental deficiency was 
commonly cited as an important cause of juvenile delinquency ; by 1920 it had 
become, at any rate in this country, one of the least important. f 

However, psychologists are by profession trained to distrust armchair deduc- 
tions and to demand a series of first-hand experiments before they pronounce any 
final verdict. Accordingly, they at any rate, should surely press for an actual trial of 
these new proposals on a limited and inexpensive scale, before the change is made 
universal. This was, in fact, the way in which the special schools I have just des- 
cribed were actually introduced. Under the earlier Acts, which were permissive 
rather than obligatory, one or two of the more enterprising authorities tried out a 
pilot scheme first of all. Let me add that what perhaps are most urgently needed 
are experimental studies of the various types of practical training that are feasible 
under classroom conditions to see whether any of them are likely to be really effective 
and, if so, which types of training suit which types of adolescent boys and girls. 

Nevertheless, the school alone cannot do all. Much in the attitude of the 
unwilling pupils is just a second-hand reflection of the attitude adopted by his 
family and his friends. The Advisory Council, as we have seen, thinks that the 
parental objections spring chiefly from the fact that "leavingat 15 is deeply embedded 
in the social structure." However, “ leaving at 15 " is too recent a change to have 
become part of the working class tradition. The parents of the present generation, 
like the grandparents before them, left school at 14 or even earlier. In these families, 
therefore, both sons and daughters are assumed to ‘ come of age ’ on their fourteenth 
birthday, that is, directly they have reached what the lawyers call actis plena 
pubertatis, and are, thereby, transformed from ' children’ to ` young persons.’ By 
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then, itissupposed, they are sufficiently mature to become financially independent 
of their parents, and old enough to start contributing to the family income. To-day, 
no doubt, most of the parents could easily afford to support their youngsters for a 
much longer period then was possible a generation ago; and many are already 
beginning to realize that the youth who has profited by a further spell of education 
will be far more likely to secure a good position in the industrial world than if he 
had left school at the earlier age. The demand for better educated employees is 
rapidly increasing. Already there is an urgent need, not merely for more technolog- 
ists of high intelligence, but also for more technicians of moderate or average ability. 
And for those who look forward to such posts an extra year or so of schooling is 
unquestionably an asset. 

But the rise in economic standards has not only sent up the incomes of the 
parents ; it has led to an increase in the earnings of the young, which is tantamount 
to a domestic revolution. Many a youngster at 15} is not only able to keep himself, 
but his weekly wage packet endows him with a lavish purchasing power which he is 
quite unfitted to use intelligently. And it is the allurement of these high earnings 
that induces so many of them to leave as soon as the law permits. But, it seems 
extremely doubtful whether a new Act of Parliament which forced such youngsters 
to remain for another year against their will and against the wishes of their parents 
would prove an effective remedy. Already, the number of.those who are sensible 
enough to stay on has begun to increase; and, as they and their parents appreciate 
the advantages to be gained, and as the schools themselves remodel their curricula 


so as to provide a more attractive and more suitable training, it will continue to 
increase at an accelerated rate. 
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these reasons the Committee resolved at the outset to issue its recommendations 
with the utmost speed, and was therefore unable to undertake any extensive surveys 
or researches of its own. So far as they go, the proposals put forward will be 
generally welcomed. How far they can be successfully implemented, and whether 
they will fulfil all that the Committee hope for, are issues that time alone can decide. 

The Report opens with an instructive review of the history, scope and limitations 
of the Youth Service, as set up in 1939, and then goes on to consider in some detail 
the changing pattern of adolescent life. The adolescents of to-day are taller, heavier, 
and more mature than their parents were at the same chronological age. During the 
last half century the average gain in height has been about }-inch per decade and 
1 pound in weight. Most boys and girls now reach puberty well before the age of 15. 
These changes, it has been suggested, may be in part the result of an underlying 
genetic change. During the preceding centuries there was a considerable amount of 
inbreeding. With the advent of the railway, and still more of the motor car, the 
motor bus, and the aeroplane, geographical isolation has become a thing of the past ; 
and within the present century social barriers have been gradually broken down. As 
a result outbreeding has started to take the place of inbreeding. And it is well 
known that in both animals and plants such a change in the mating system is 
frequently followed by what is known as ‘ hybrid vigour.’ However, pending 
further research on this and allied problems, it will probably be safer to assume that 
more obvious causes are chiefly responsible. Early maturation is by no means such 
a novel phenomenon as is commonly supposed. “The age of puberty was quite as 
early 300 years ago: and there is ample evidence to show that, with the change to 
urban conditions that accompanied the industrial revolution, the rate of growth was 
progressively slowed down. To-day, with better food, improved social conditions, 
and greater opportunities for physical exercise and recreation in the fresh air, we 
seem now to be reverting to the more normal growth-rate of our ancestors. 

To meet the altered conditions of adolescent life the Committee suggests that the 
Minister of Education should “ initiate a ten-year development programme divided 
into two stages." The first should consist in what they term a period of “ blood 
transfusion," that is, a stage of special short-term measures, supported by emergency 
grants, to meet the more immediate needs, particularly those of the impending 
bulge. The second phase, we are told, should consist in “ à sustained and continuous 
nourishment "' of the several services involved. To supervise and organize the nec- 
essary schemes a new advisory council should be appointed, to be called the Youth 
Service Development Council. The objects of the Service, it is emphasized, are not 
(as is so often supposed) ‘‘ merely negative—a mere means for ‘ keeping young peiple 
out of trouble ' or ‘ off the streets’: rather they must be positive and constructive.” 
The Report sums them up under three main heads: first to provide associations 
of the right sort; secondly, to supply a training suited to the varying needs of the 
young people concerned ; and thirdly, to furnish opportunities for self-expression 
and self-development adapted to the interests and abilities of different individuals 
so that both youths and girls may have a chance to display, and to respect, pre- 
eminence in other fields than the merely academic, instead of being content with the 
cruder forms of pleasure and excitement prompted by their own primitive impulses 
or with the more sophisticated forms devised by those who find it profitable to ex Toit 
Los ME emotions with the aid of modern mass-media and other cofiüme cis 
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Most of these activities will require and furnish their own type of discipline. 
And, it is to be hoped, throughout the Youth Service the ultimate goal will be to 
achieve what in the language of the old-fashioned pedagogue would be called 
' internal discipline ' rather than ‘ external discipline,’ and to secure it by ' indirect 
methods ' rather than ‘direct.’ In the past, the avowed educational and moral 
aims which formed the raison d'etre of such clubs proved their undoing. After all 
how many of us would join a club whose professed object was to improve its members' 
characters? Even the word ‘leader’ has a suspicious ring. The adolescents of 
to-day are sheep who strongly resent a shepherd. Consequently, those who are most 
in need of the benefits which the youth services can provide tend to stay outside. 
Too often, it would seem, the Report seems to have forgotten that it is dealing with 
inexperienced teen-agers and Teddy-boys, who, though fully adult in body, have a 
mentality corresponding to a Terman-Binet age of barely 14. Half of them will 
never exceed the intelligence of children of 12 or 13 or even less. But, as the 
Committee remark at the end of their recommendations, they do not profess at this 
juncture to offer guidance in regard to details: “ Much needs to be learnt in the 
light of further experience." And the difficulties we have mentioned will doubtless 
be closely watched by the new Development Councii. 

Although each of the Reports contains many minor points on which both the 
practical teacher and the psychological theorist will be tempted to proffer criticism, 
they have between them accomplished a most useful task. They have uncovered a 
series of urgent problems which parliaments, governments, and local bodies have so 


far lacked the courage, or the power, or the necessary knowledge to attack on 
systematic lines with the necessary financial aid ; and they have brought to light a 
number of deeper issues of which the general public has hitherto been wholly 
unaware. 
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Journal of Child Psychology and Psychiatry and Allied Disciplines (ed. by C. B. 

Hindley, E. Irvine and E. Miller). London: Pergamon Press, Vol. 1, 

No. I, Jan., 1960 ; to appear quarterly. Annual subscription £5 5s. (private 

individuals), £7 (libraries and institutions). 

We welcome the launching of yet another new journal in this country which, 
while likely to overlap but slightly with the British Journal of Educational Psychology, 
certainly merits the attention of our readers. Its aim is to include contributions 
from any discipline which further the understanding of children. However, the 
content of the first issue suggests that it will be primarily concerned with abnormal- 
ities of childhood, five out of eight authors (or pairs of authors) being psychiatrists, 
and two others clinical psychologists. Perhaps one of the most valuable outcomes 
will be the co-operation that it brings about between the medical and psychological 
approaches to children. 

Unfortunately, the Journal is expensive, though a considerable reduction is 
made to members of the Association of Child Psychology and Psychiatry. Apart 
from a somewhat excessive width of line, the format, type and paper are attractive. 

PEV 


ANDRY, R. G. (1960). Delinquency and Parental Pathology. Methuen, pp. xv-+ 
173; 21s. 


Many readers will already have heard of Dr. Andry's recent investigation, which 
showed that an unsatisfactory relationship between a boy and his father is more apt 
to be associated with delinquency than the widely publicized factor of maternal 
deprivation. This volume is useful in describing his methods, his delinquent subjects 
and controls, his oral questionnaire, and his quantitative results, in full; (a slight 
criticism might be raised that it is too full, since there is rather a lot of repetition). 
Among the most differentiating items were the boy's feeling that his mother loved 
him more, few activities or interests shared with the father, and failure to turn to 
the parents when in trouble ; much absence of the father from home showed a slight 
association, but not the mother working. That these answers do not merely represent 
the delinquent's attitudes was checked by asking much the same questions of both 
parents in à sub-sample; there was close accord in about 80 per cent. of cases. 
A very consistent picture of the delinquent's conception of his father's role is revealed 
and Dr. Andry shows that this fits in both with Freudian, and with modern Learning, 
theory. At the end there is a useful short survey of the relevant literature. 


CuurcHMAN, C. W., and Ratooss, P. (editors) (1959). Measurement: Defini- 
tions and Theories. New York: Wiley and Sons ; London : Chapman and 
Hall; pp. 274, 64s. 


The December, 1956, meeting of the American Association for the Advancement 
of Science included a five-part symposium on measurement in various branches of 
science. The contributions to this symposium have been edited and presented as 
discussion on the various views held at the moment by workers de j 
fundamental measurement. Sections in this book deal with meani 
of measurement and measurement in the physical and soci 
mathematical arguments included in the contribution by C. H. Coombs on the 
‘Inconsistency of preferences as a measure of psychological distance,” might be a 
useful reference to psychologists working in this very specialised section of | Ak Ta 
metrics. In this experiment, four psychology students were presented s a oile 
of twelve grey colours (ranging from almost white to almost black) in sets of four at 
a time. It was found that each individual's concept of ' erevness ' w i E 
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Cote, L. (1959). Psychology of Adolescence. New York: Rinehart, pp. xviii- 
731, $7.00. 


This is the fifth edition of a book which originally appeared in 1936. It is 
easy to see why this has been such a popular book : it is comprehensive, readable and 
helpful. There are sections on Physical, Intellectual, Emotional, Social and Moral 
development, and in each case, after presenting a detailed account of normal boys 
and girls, Dr. Cole describes the characteristics of deviates, incorporating the 
results of much recent research. Case studies, diagrams and statistics are widely 
employed and appendices suggest books, films, problems and projects. 

Although the book is intended for all who have to deal with adolescents (and 
its publishers claim it has been « widely used as a reference by guidance counsellors, 
doctors, and parents, and by adolescents, themselves ”), it is clear that the writer is 
thinking primarily of teachers, and it is teachers and students in training who would 
find her work particularly stimulating. Throughout the book, she brings out 
admirably the relevance of psychological considerations to the classroom situation. 


Part Six, “ Educational Applications " deals with topics of perennial importance, 
such as “ Emotional Problems in the Schoolroom,” “ The Social Life of the School," 
“ The High School Curriculum." 


Possibly Dr. Cole is most at ease with brilliant children. She does not distinguish 
sufficiently clearly between dull, backward, retarded and maladjusted—in fact, she 
uses ' retarded ’ in its more popular sense and ignores the children whose educational 
attainment is not in keeping with their intellectual ability whether their L.Qs. are 
low, average or high. 

An English reader might be tempted to suggest that ‘ 
Adolescence " would make a more accurate title. Throughout the book, Dr. Cole 
generalises on evidence, research and reading drawn almost entirely from American 
Sources. In the preface, she points out that the li 
extensive. It is a pity that among so many hun 
include at least a few from outside the States, 
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take place will, in the author's opinion, “ depend on the optimism of the observer 
as well as on the possibility of developing means of treatment which will be more 
effective than those at present in use." 

In a short chapter on education, the authors admit that they have “ no sig- 
nificant information about this aspect of the situation and make no report." Never- 
theless, it is obvious that they have given a good deal of thought to many of the 
difficulties which commonly beset those who are responsible for establishing and 
implementing special schemes. 

"Throughout the book runs a thread of warning that, since the parent-child 
relationship inevitably changes as each grows older, in special educational treatment 
as in family life, the devising of appropriate degrees of pressure and concession is 
one of the most difficult tasks in dealing with children whose development deviates 
from normal. 

In conclusion, the authors state their belief that, to persist in the same sort of 
special treatment, whether educational or physical, which may have been effective 
during stages of rapid development, may be very inadvisable in later years. 

The book is not entirely discursive: of the eighteen chapters, eight are 
devoted to classification, aetiology, pathology and treatment. The text is well 
illustrated with tables, figures and photographs, and a useful bibliography lists over 
170 other authors to whose work reference is made. M. I. Dunspon. 


Dienss, Z. P. (1959). Concept Formation and Personality. Leicester University 
Press, pp. xi4-82, 18s. 


This book describes the basis and conduct of an investigation into likely 
relationships between some aspects of concept formation in the mathematical field 
and certain personality characteristics as represented by teachers' ratings of their 
pupils. The early discussion offers an interesting analysis of the ability to engage in 
mathematical activity, distinguishing, on the one hand, the acquisition of concepts, 
and, on the other, the processes whereby these are manipulated to generate new and 
more complex ideas. It is suggested that individuals acquire and develop concepts 
in their own ways and not according toastereotyped pattern. The actual processes 
of concept formation, as used in the later experiments, are developed from the work 
of Lewin, Eysenck and Rokeach. The latter part of the book describes and analyses 
an experiment in concept formation, along Piagetian lines with a mixed group of 
one hundred ten-year-old children. 

Children have to learn Mathematics as it is taught in school. Moreover, con- 
cepts of number are basic to later more abstract work. After so interesting a 
preliminary discussion, it is, therefore, disappointing to find an experiment in which 
the normal school Mathematics has no place. Such an experiment is removed from 
both the physical and the subject context of the teaching situation, and hence two 
of the most important factors in the learning situation. In addition, the acquisition 
of such operations as used here does not correspond to what is expected of children 
in straightforward mathematical work. It is not known whether the ability to 
acquire and use these operations in this type of experimental situation would transfer 
into the corresponding situation involving mathematical elements. Unlike Piaget's 
work, this experiment seems to be concerned with the acquisition and use of a 
certain type of mathematical operation in a non-mathematical situation. This has 
probably no more relation to the learning of Mathematics than to any other subject 
field. 

The ideas expressed in the theory are widely realised amongst teachers of 
Mathematics, despite the author's comments in this respect. The book is valuable 
in that it formalises these and brings them into perspective. But the language of 
the book is too complex for many teachers. D. M. LEE. 


GALANTER, E. (edit.) (1959). Automatic Teaching—The Stat ihe Art 
York: John Wiley; London: Chapman and Hall; m ak 198. as E 


Earlier automatic teachers, such as books, have suffered from their inability to 
answer back, but the latest models do so quite easily ; for if the reader answers 
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‘A’ to a problem, he must turn to page ‘ X,’ and if he answers ' B ’ to page ‘ Y, and 
so on, the process being repeated “until the dullest person has got all the answers 
right, and the brightest has pressed on through volume after volume. Alternatively, 
we can use ‘ chemo cards,’ on which the right answers change colour, but the wrong 
answers do not, or we can construct elaborate machines. As Jacob Beck, oné of the 
authors of this book says: “ Traditional teaching devices such as books and lectures 
are almost completely uncontrolled, The discriminative stimuli are not explicitly 
specified by these devices . . . Many a difficulty in teaching can be traced to students 
' focusing ' on the wrong stimuli and attaching responses to them. Furthermore, 
texts and lectures do not specify the particular response which it is desired that an 
individual acquire. Since they do not generally require the student to make any 
response, little if anything can be known of the student's responses to the content 
presented . .. The rules which guide a student's responses are also indefinite . . . 
Reinforcement under these conditions is also a haphazard matter—depending upon 
accidental self-administration.” 

Apart from the help that such machines can obviously give the self-taught, 
most teachers will view mechanical aids with suspicion. Nevertheless, there is no 
doubt that it would benefit most teachers to have to program some of their lessons 
on these machines, for it becomes necessary to list the sequence of operations that an 
ideal teacher must perform, and to develop, in the process, a general theory of learn- 
ing and teaching. We are faced, for example, with such questions as, ' What is the 
correct order of presentation of material? . . . Is there an optimum number of errors 
that should be made ? . . . Should prompts be used, and, if so, how many, and what 
Tr QM MA paring problem —experimenter-controlled rate vs. self-pacing ; 

p z Problem . . . whether one program is satisfactory for all students 
and the constructive vs. multiple choice Tesponse problem. Many of the basic 
principles of learning theory have to be re-examined. For instance, is it reasonable to 


suppose that an aperiodic schedule may enhance the inherent interest i j 
matter, as it seems to do in animal experiments ? waitin aes 
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increased and brought up to date and some topics which were only sketchily treated 
in the previous edition have been accorded fuller treatment, for example, cerebral 
palsy, certain clinical types of mental deficiency and heart conditions. The material 
has been re-arranged into four main sections: mental deviates (including gifted 
children) ; physical disabilities (defective vision, hearing and speech and orthopaedic 
defects) ; neurological impairments and health problems (brain dam 


age, epileptics, 
delicate children) ; emotional and social problems (maladjustment and delinquency). 
The book gathers together a good 


deal of basic information about the nature 
and causes of handicaps, summarises the psychological characteristics of exceptional 
children and relates these to the broad lines of education and treatment. It would, 
therefore, have value as a source of reference in college libraries and as an outline 
of the whole field for experienced teachers taking courses of training in the education 
of handicapped children. (It was written for this purpose in American courses in 
Special education.) Students specialising in a particular handicap would find little 
in the discussion of their own field which they would not get more fully and ade- 
quately elsewhere. Moreover, the value of the book for British students is consider- 
ably reduced by the almost complete lack of reference to British work and reports, 
and there are the inevitable differences arising from different terminology and 
methods of organising special education. 

These two objections apply even more to the book by Magnifico which, be: 
largely a discussion of policy and organisation of special education in the American 
setting, is chiefly of interest in showing how similar problems are being discussed and 
tackled in another country, We should, of course, be familiar with and ready to 
learn from systems which differ from our own but, apart from that, this book has 
limited value for the British reader. Two chapters on gifted children will be of 
general interest. The failure of American education to giv 
developing the most gifted children is discussed in relation 
social attitudes which have impeded progress in this field. 
reaction to the first earth satellite is seen as an opportunit 
these barriers. 
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GLOVER, E. (1959). The Roots of Crime. London : Imago, pp. xiii--422, 45s. 


Dr. Glover, an outstanding authority on Freud and criminology, presents us 
with a lucid textbook which will rank high in its chosen field. It consists mainly of 
reprinted essays and lectures dating from 1922 to the present time. With faith and 
consistency, he has spread his concepts, at first almost alone, and from the 
nineteen-thirties, together with a handful of equally eminent colleagues (for instance 
the late Dr. Dennis Carrol) through the Portman Clinic, the Institute for the Study 
and Treatment of Delinquency, and since 1950, through the British Journal of 
Delinquency, of which he is co-editor with Dr. Hermann Mannheim and Dr. E. Miller 

His strong message can perhaps be best evaluated under two headings : the 
psycho-dynamics of criminal behaviour, and how social action can best be secured 
in the interest of both the criminal and of society. Both views inevitably arouse the 
kind of controversy to be expected when a leader pronounces on the social sciences 
which, as yet, are full of mysteries in comparison with already existing knowled: e 
in the field of the normative sciences. Ultimately, the Controversy is based ent 
on whether the pronouncements are merely hypotheses or are likely to Berdo E 54 
hard facts. ; s P into 

He stands on controversial ground, as he admits, when he a TS 
that he has “ no great faith in the fertility of existing forms of ‘ combines Er 
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and of society’s reaction to it: the existence of the super-ego as the seat of the 
' unconscious conscience,’ its guilt, and its unconscious need for punishment in an 
endeavour to relieve guilt. Through committing offences, the criminal unconsciously 
seeks to be punished, and, in turn, not only indirectly punishes himself, but also 
society. Society cries out for punishment of the criminal, because its members do 
not understand that their desire for vengeance stems from their own unconscious 
guilt feelings, which a convenient criminal scapegoat can diminish. 

Given this insightful hypothesis, what is the cure? Dr. Glover makes many 
interesting comments, ranging from the suggestion to psycho-analyse individuals, 
especially the recalcitrant but not incurable psychopath, along orthodox Freudian 
(and not Kleinian) lines, to setting up, through the Home Office, a project for a 
screening service in schools, in order to detect, register and treat, all cases of pre- 
delinquency. Dr. Glover's important book contains many fertile ideas which should 
keep a generation of serious students in this field busy proving him wrong or right. 


R. G. ANDRY. 


Harwos, P. (edit.) (1959). Papers on the Teaching of Personality Development. 
The Sociological Review, Monograph No. 2. Keele: University College of 
North Staffordshire, pp. 149. Price, 12s. 

Following the issue of the first monograph of The Sociological Review, in 
1958, which dealt with the teaching of rere lity Aaa du students of 
education and social work, this, the second monograph, deals with further problems 
arising from the teaching of personality development. It represents the Conference 
on Personality Development at the University of Leicester in April, 1959. In the 
introduction, Professor W. J. H. Sprott Says that the papers of the first monograph, 
representing the 1958 conference at Keele, were concerned with theoretical matters, 
whereas the papers of this, the second conference, are severely practical in character. 


The list of contributors to this volume, wi ; E. poteg : 
J. H. Nicholson, who writes Ron with the topics dealt with, is as follows : 


en s j Students' attitudes, past and present; 
Clare W pape who deals with the development of insight ; G. p. Meredith, whose 
ie en achers and social workers; J. E. Richardson, who 
describes the exploration of teacher and pupil roles in a training house ; H. F. Powell 
: y development; Kay F. McDougall, who 

ee P , n tis eer methods and problems of identification ; Mia L 
= ayes de ey fa X the teaching of personality development in a diploma 
Sane A is Bg Vi ; Norah Gibbs, who reflects on the conferences at 
an ; and the contributions are brought to a close by Paul Halmos, 
issues of professional education. eaching dynamic psychology and the moral 
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excellent progress, though one feels some doubts about the method of calculating 
‘acceleration.’ For example, a child who had reached a Reading Age of 6-0 after 
five years at an ordinary school and 8-0 after two years’ attendance at the clinic, is 
claimed to have increased his rate of progress five times. 

A novel feature of the Report is that good progress in reading (though not in 
arithmetic) was maintained for several months after discharge. The pamphlet also 
contains a useful discussion of the educational difficulties of the emotionally dis- 
turbed child, and of the place of such a remedial unit in schools or homes for the 
maladjusted. 


Knicut, R., and Knicut, M. (1960). A Modern Introduction to Psychology. 
(Sixth edition, revised). University Tutorial Press, pp. 272, 9s. 


The Sixth Edition, Revised, of this extremely concise and admirably lucid 
introduction to psychology is to be welcomed. The first five chapters have been 
thoroughly revised as have the chapters on “ Animal Learning " and “ Instinct." 
The chapter on “ Brain and Mind ” has been re-written to give more adequate 
insight into the semantics of the topic. The Notes on Reading have been brought 
up to date and some new footnotes have been added. 

The balance of the book remains weighted in favour of physiological psychology, 
which takes up about half the text. The remainder is given over to Intelligence, 
Learning, Personality, Instinct and Psycho-analytic Theory. It might well have 
been more thorough to have included a chapter on '' Social Psychology." This 
could have replaced the chapteron “ Dreams " which seems the least satisfactory 
chapter. 

Good though this introductory book is, it may be more suitable for students of 
psychology than for students of educational psychology. Nevertheless, it should be 
included in their recommended reading because of the use the authors make of 
relevant research and the clarity with which they write. P. H. TAYLOR. 


LEANING, P. A. (1959). The Challenge of Cerebral Palsy. Auckland, N.Z. : 
Whitcombe and Tombs, pp. 98, no price given. 


The author intends this book for teachers and parents of cerebral palsied child- 
ren. For those coming newly to the field, it will provide a useful introduction. 
Miss Leaning gives a concise account of the physical and educational handicaps of 
cerebral palsied children, and discusses the remedial methods employed. The main 
part of the book is devoted to educational problems and methods, and there are also 
chapters on speech, occupational and physio-therapy, as well as vocational guidance 
and training. Before writing this book, Miss Leaning made a world-wide tour of 
cerebral palsy centres, and so she is able to illustrate her points with brief accounts 
of some of the different educational approaches she observed, and with the comments 
and opinions of those she met. 

Those who are familiar with this field will probably gain most from Miss 
Leaning’s account of the effect of the physical handicap on the child’s personality 
She is herself cerebral palsied, and so is able to bring to her very objective treatment 
of this subject, the insight gained from her own experience. She points out how the 
effect of the physical handicap extends to many unsuspected facets of the child’s life 
and emphasizes the importance of parents’ and teachers’ attitudes in avoidin a 
feeling of hoplessness in the cand. 8? 

Readers of this book already working for cerebral palsied childr i 
Miss Leaning's plea for a greater exchange of pi ee in this Held. : M aem 
only regret that she does not give more detailed information j 
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year juniors. It closely resembles a Moray House Verbal Reasoning test with 
deviation-quotient norms, though in fact, it does not register below I.Q. 78 at 94 
years. It was standardised on some 4,500 children in the Midlands. The one- 
hundred items are of conventional type, and 45 minutes are allowed. It is preceded 
by a practice sheet taking about 15 minutes, but thereafter, the children have to 
read the instructions and items for themselves. The author admits its dependence 


on reading, and prefers to regard it as a measure of ‘ scholastic aptitude ' rather 
than ' intelligence." 


Monsin, S. M. (1959). Practical Handbook of Guidance in Secondary Schools. 
Bihar, Patna: Superintendent Secretariat Press, pp. 169, Rs. 2.25. 


The Bihar State Bureau of Educational and Vocational Guidance, under the 
directorship of the author, appears to have set up an interesting cross between the 
American system of school Counselling and the British system of youth employment 
service. In each secondary School a teacher is given 6 months' training in the theory 
and practice of counselling. He then becomes responsible for selecting 13-- pupils 
for transfer from elementary schools. He keeps records and applies attainments 
tests during their first. year as a basis for allocating them to one of the various 
secondary courses—science, arts, technical, agricultural, etc. "Throughout the 
school life, he is available for consultation on scholastic and personal problems, and 
is responsible for providing occupational information at the appropriate time. 
Chen, before leaving age at 17--, he has complete records of the school career and 


background, and can apply interests and aptitude te: á : x m 
American sources by the Bureau) ptitude tests (mostly adapted fro 


: Finally, he refers the leaver to the Youth 
Rae l; A s re You 
mipya x om ot the Employment Exchange, with his recommendations. 

t e ae pm ec i how completely this System operates, but it would certainly 
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poe mn ale TREES masters. This volume is a working textbook for 
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or earlier; in others the children have themselves adapted an old expression or 
rhyme to fit a new situation. Even links with the U.S.A. are revealed. 

The book is more likely to be useful to the sociologist than to the psychologist, 
though the latter may profit from noting the prevalence of compulsive-obsessions in 
presumably normal children. I was glad to add to my own collection of these the 
reference to General Gordon's habit of zig-zagging on a stone pavement to avoid 
treading on the cracks. 

Certainly, this volume would make at least an excellent bed-side book for the 
child psychologist. C. W. VALENTINE. 


Peters, R. S. (1959). Authority, Responsibility and Education. Allen and 
Unwin, pp. 137, 12s. 6d. 


“ One of the most awe-inspiring features of academic institutions is the tyranny 
of fashions," as Dr. Peters wryly observes. Any book that leads its readers to 
examine the roots of current educational beliefs and practices is, therefore, salutary, 
whether the reader accepts the author’s findings or not. 

Dr. Peter’s book, based mainly on radio talks given between 1956 and 1959, 
does just this. Some of the educational and philosophical concepts he scrutinises 
are age-old, others are of more recent appearance, popular derivations of Marx 
and Freud, but all concern fundamental educational issues. 

This radical approach leads him not only to question common assumptions, but 
sometimes to make a case for their opposite. He suggests, for instance, that far 
from being cleverer than he is moral, twentieth-century man has so far not shown 
enough intelligent foresight to ensure the maintenance of old morality in new social 
conditions. He also suggests that the psychology of learning could benefit more 
from a study of the practice of education than the teacher can at present from learn- 
ing theory. 

But Dr. Peters builds most of his case for the latter on the mechanistic learning 
theories of Hull and Skinner, and omits more recent work on learning in its genetic 
and social setting (except that of Piaget) which in the experience of this reviewer some 
teachers find both relevant and useful. Here Dr. Peters falls into the same sort of 
See as he has in previous parts of the book been spotlighting and castigating in 
others. 

Nevertheless, the book makes lively reading, and it is recommended to lecturers 
in Education, not as a complete statement of the case on any of the points raised, 
but as a stimulating point of departure for seminar discussion groups. 


Cora TENEN. 


Pats, B. N., Duxe, L. D., and De Vaurr, M. V. (1960). Psychology at 
Work in the Elementary School Classroom. New York: Harper, pp. xii 
395. No price given. 


Both in its method and its content this is an unusual approach to the te; 
of educational psychology. Essentially, it consists of a series of reprints of recent 
research articles dealing with aspects of child behaviour in classroom Situations 
Each article is preceded by a brief survey of the wider field wt " 
the particular research together with a selected list of referenc 
the underlying psychological principles. The application of the: 
tional problems is emphasised. The papers are well-chose 
enquiries carried out mainly in America during the past ten years. They make use 
of a wide range of experimental methods and statistical treatments, and are classified 
Ssroom Organisati - 
personal Relationships " and “ Pupil Behaviour and Learning " i apices 

On the assumption that transfer from research findin 
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Psychology and statistics to make much of the research papers. It is true that 
many of the papers have a “ Summary and conclusions "' paragraph at the end but 
this, by itself, would have little value. There Seems, in fact, to be only one way of 
making research findings useful to teachers, and that is to teach them much more 
Psychology and statistics during their initial and in-service training, so that they can 
at least read with understanding the publications which have a bearing on their 


craft. L. B. Biren. 


Pressey, S. L., ROBINSON, F. P., and Horrocks, J. E. (1959). Psychology in 
Education. New York: Harper, pp. xi+658, $6.50. 

Macponatp, F. J. (1960). Educational Psychology. San Francisco: Wads- 

worth Publ. Co., pp. Xix-4-748, no price given. 

The first of these books, written by three distin 


University, brings up to date, without greatly extending, a work written by Pressey 
in 1933. Tt is of broad coverage wi i 


without being overloaded with detail and includes 
relevant research up to 1957. 

It departs from the Conventional approach in treating development as a process 
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STOGDILL, R. M. (1959). Individual Behaviour and Group Achievement: A 
Theory—The Experimental Evidence. New York: Oxford University 
Press, pp. 352, 40s. 


The author has already published a large number of researches on sociometry, 
leadership and group relationships during the past ten years and this book reports 
a theory which he has developed in the course of his close association with the 
topic of social groups. He describes an organised group as an input-output system 
in unstable balance, and proceeds to analyse the Structure of the ' input" and 
'output' variables. He assumes that the minimum aspects of behaviour necessary 
to describe ‘input’ are the performances, interactions and expectations of the 
members of the group. The last, which would not necessarily be allowed by other 
theorists, is a variable which is associated with the measurement of the purpose of the 
group ; formally he defines ‘ expectation ' as " readiness for reinforcement, and is a 
function of drive, estimated probability of a Possible outcome and the estimated 
desirability of this outcome." He comments on the obvious links between this 
definition and modern learning theory. He also criticises a tendency of some 
writers to confuse ‘ performance '—an individual variable, with ‘ interaction ’—a 
group variable. These three variables, in combination, “ account for the develop- 
ment of group structure and for the initiation and maintenance of group operations.” 
These, in turn, result in ‘ output ’ or achievement, in which he includes not only 
productivity, but morale and integration. He has an interesting concept cf group 
morale as “ the degree of freedom Írom restraint exhibited by a group in operating 
on a goal objective." These three dimensions of output are not necessarily inde- 
pendent, 

Within this Íramework, a substantial part of the book is devoted to a detailed 
examination of each of the input and Output variables and their relationships both 
to each other and to the internal working and development of the group. In each 
case the theory is followed by an examination of available research findings for 
confirmatory evidence. The lack of evidence from experiments scientiücally 
designed to test the various aspects of the theory is a weakness which the author 
himself realises. He states, in fact,“ .., Some may doubt that the same experiment 
can lend equal Support to various theories and sub-theories." The connection 
between the research reported and theory is often tenuous and the thread in the 
author's mind often tends to be obscured by the enormous amount of detail he 
includes, 

In discussing ‘ interaction ’ he attempts to find ground common to his theory 
of social groups and the mathematical theory of groups. This is not explicitly 
developed, and its introduction perhaps seems a little pretentious, Nevertheless, 
for the advanced student of group theory the book is likely to be of use on account 
of its wide coverage of existing research in the form of abstracts, as well as the 
collected bibliography. It would have made its points more cogently if the author 
had made some effort to vary the method of presentation with the aid of tabu 


S lations 
and charts, 


D. M. EDWARDS PENrorp, 


TANsLEY, A. E., and GULLIFORD, R. (1960). The Education 
Children. Routledge and Kegan Paul, PP. viii-4-255, 28s 
This book will be invaluable to all teachers in E.S,N. Schools, and it has much to 

Offer to those concerned with backward classes Or groups in ordinary Schools, A 

large amount of practical guidance is given on methods of helping the very backward 

in language and reading, number, arts and crafts, and general social competence : 
and many ingenious ways are described of enlisting their interest and building up 
favourable attitudes after a long history of failure and 8 up 
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sensible position on intellectual deficiency, bringing out the value (though also the 
limitations) of intelligence measures, while admitting that LOs. are often lowered 
by unfavourable environment and may alter, and avoiding the fallacies of the 
Achievement Quotient. The third shows how the needs of E.S.N. children differ 
but little from those of the average child, though they are frequently characterised 
by emotional immaturity. The fourth is a useful compendium of information on 
the common physical deficiences often associated with intellectual backwardness. 
Apart from the chapters on teaching, perhaps the most original section of the 
book is the last one which discusses ways of preparing the E.S.N. leaver for a vocation 
and for life as a self-supporting adult. p.E.V. 


TIBBLE, J. W. (1960). Conference Report. London: New Education Fellowship, 
pp. 84, 5s. 6d. 


This is a brief report on the conduct of an international Conference held at 
Utrecht in 1956, where most of the work was done in discussion groups under 
experienced group leaders. A questionnaire was circulated eighteen months later, 
and a majority of the members who answered seem to have had favourable recollec- 
tions. The useful effects of the procedure and occasional weaknesses are carefully 
analysed and discussed. Curiously, the report seems to have nothing to say as to 
whether members learnt anything fresh about the ostensible topic of the Con- 


ference—Constructive Education and Mental Health in Home, School and Com- 
munity. 


— (1959). They Steal for Love. (London: Max Parrish, pp. 132, 


d This Posie describes an experiment in Education and Psychiatry with children 
and parents. Facilities were made available under which a small number of disturbed 
children could be brought together to live in a community which was nearly as 
pM : orma] ofie in the generally accepted sense of that term. The community 
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Wootton, BARBARA (1959). Social Science and Social Pathology. London : 

George Allen and Unwin, pp. 400, 35s. 

This book offers us a Social Scientist's analysis of present-day views and 
attitudes on anti-social behaviour and all that they imply. The first part of the 
book contains a useful review of research on the relation between criminality and 
various social and economic factors, e.g., truanting, broken home, poverty, etc., and 
critically examines the validity of certain currently popular hypotheses. 

One of these is the theory of maternal separation as a causative factor in crime. 
Lady Wootton, after reviewing the contradictory evidence, concludes that all that 
has been shown is that children need dependable love, something which she, not 
unnaturally, regards as a homely truth known for centuries. The main usefulness 
of the theory, she claims, lies in throwing light on “ deplorable patterns of institu- 
tional upbringing." Is this not both an over-simplification of the theoretical 
position as regards the fundamental early relationships and an underestimate of the 
practical consequences the wide acceptance of the theory has had—which would not 
have come about without ? 

The second part deals with changing contemporary attitudes towards social 
deviants, so deeply coloured by the psychiatric approach. While Lady Wootton 
welcomes the humanising influence of this trend, this part—and indeed, the book— 
is largely devoted to a critical exploration of the limitations and fallacies of 
psychiatric theories. (Psychologists are presumably included among the “ psychia- 
trically-orientated satellites,’’) ` 

The book renders a service in drawing attention to the comparatively subjective 
nature of the definitions of mental health and mental illness (the author has amassed 
an impressive list of them), yet it also seems to exaggerate their inadequacy, Lady 
Wootton assumes that mental illness is generally diagnosed merely on the basis of 
delinquent behaviour. When reviewing present practice, she writes i 
true to say that the illness is the behaviour for which it is also the excuse ” (page 225). 
She here ignores the possibility that an unresolved conflict within the self may be 
regarded as being the illness and that this is diagnosed by a syndrome of particular 
dys-functions, not all of which are of a de nquent nature. j 

In an excellent chapter on “ Mental Disorder and Criminal Responsibility,” 
she explores with relentless logic the difficulties involved in arriving at clear 
criteria for diminished responsibility in law other than the intellectual criteria of the 
McNaughten Rules, while still retaining the concept of responsibility in any form. 
She herself seems to favour the policy of by-passing the question of responsibility 
and deciding the kind of ‘ treatment ` to be meted out to the criminal by assessing 
what will help most to modify his attitudes. 

: Part III of the book contains the conclusions on methodology, where Lady 
Wootton favours carefully controlled research with modest aims ( prediction ' 
rather than ' causation '), and practical conclusions which are less clear-cut, She 
prefers the environmental to the individual approach to social problems because she 
fears that the present emphasis on individualistic study will divert attention away 
from the need for environmental action, in fact, that psychological treatment will 
become the new opium of the masses. This view makes her Sceptical of Child 
Guidance Clinics. She considers that it would be more logical to modify the ré ime 
in the normal educational system rather than to create new institutions to deal wrth 
its misfits. Dealing with misfits, one feels, far from covers all the functions of z 
Child Guidance Clinic, one of which surely is precisely to guide and help schoole + 
adopt a psychologically sound approach to children. F SNB iu 

Throughout the book there runs a note of regret 
the medical empire in these fields, and, indeed, 
defects of social and interpersonal developme 
necessity be the prerogative of medically qu 
Ian Suttie and Professor Aubrey Lewis seem 
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ZANGWILL, O. L. (1960). Cerebral Dominance and its Relation to Psychological 
Function. Published for the William Ramsey Henderson Trust. Edin- 
burgh : Oliver and Boyd, pp. vii+-31, 10s. 6d. 


In the earlier part of the book Professor Zangwill presents evidence suggesting 
that in left-handed, or predominantly left-handed individuals, the dominant 
hemisphere as usually ascertained is more often the left than the right. Furthermore, 
from the work of Luria, also that of Subirana, it seems that patients who are left- 
handed, or have left-handed relatives, are more likely to have complete recovery 
from aphasia than dextral patients. On the other hand, the evidence also suggests 
that language mechanisms are less fully lateralized in sinistrals than dextrals, an 
in the majority of left-handed individuals speech mechanisms are likely to be dis- 
turbed by an appropriate located lesion in either cerebral hemisphere. 

During the latter parts of the work the author examines the findings from twenty 
cases of backwardness in reading and spelling in which there was no evidence of 
primary mental subnormality or gross brain damage. The results suggested to him 
that there is a prima facie case for linking dyslexia and kindred disorders with a 
sinistral tendency in the individual or in the family. Zangwill makes it clear, of 
course, that left-handedness and ambidextrality as such are not causal factors in the 
aetiology of dyslexia. But he does propose, from his own findings, and from those 
of other workers, that some left-handers, some who are ambidextrous, and some right- 
handed individuals who have sinistral antecedants or a covert sinistral tendency, 
may possess a type of cerebral organisation which is particularly vulnerable to the 
effects of stress. Thus, in these cases reading and other language functions may 
easily be disturbed by environmental conditions, 

To the reviewer it seems that the clear opinion of such distinguished workers as 
Penfield and Roberts that speech mechanisms depend upon the integrity of the left 
hemisphere in almost all cases, regardless of handedness, gives rather more emphasis 
to the importance of the left hemisphere than does the evidence marshalled in 


Table 3. But Professor Zangwill is well aware of this from his remarks at the bottom 
of page 9. . 

This book is easy to read and should be Studied by all who are concerned with 
backwardness in reading. 


K. LOVELL. 
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